[Q&A] Upgrading fluent-package 5.0.7 to 6.0.0, hangs on startup with rebuilding weight array lost_weight=0
#5101
Replies: 4 comments 6 replies
-
Hmm, I can't reproduce your issue... If you can introduce https://github.com/fluent-plugins-nursery/fluent-stackprof, If we could see that dump file, we might be able to figure something out... |
Beta Was this translation helpful? Give feedback.
-
Thanks for your report! Does this mean the logs only show up to here?
The fact that logs from other Input plugins, like the following, are not appearing suggests that it is highly likely the process is hanging somewhere during the initialization of
Having minimal reproduction steps would be really helpful. |
Beta Was this translation helpful? Give feedback.
-
I'm still trying to narrow this down - I don't think the actual problem is in the above any more. The main config includes several other files, including the one I snipped above. One of those others takes some incoming events (JSON, over TCP on an internal port), runs them through a When I remove that file, instead of hanging at this spot, fluentd starts up normally. So the real issue is somewhere over there, in all likelihood. I'm still testing, but didn't want anyone else to think I'd just dropped this entirely. |
Beta Was this translation helpful? Give feedback.
-
Upgrading My apologies for wasting everyone's time. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
What is the problem?
I'm trying to bump a test server from fluent-package 5.0.7 to 6.0.0. The only change I've made to the config as part of the version jump, is the include change I discovered was needed in another Q&A. That change shouldn't directly impact this, though.
Starting up fluentd via systemctl times out after a couple minutes. Attempting to start fluentd manually, the startup hangs at
rebuilding weight array lost_weight=0
, the fluentd process consumes 100% CPU, and fluentd never processes any log events. In either case, akill -9
is needed to stop the running fluentd process.Describe the configuration of Fluentd
This actually is a follow-up to #5100 . This is one of the files I'd omitted from the configs there :) This part of the config is fairly standalone, though.
The reason I'm doing a
copy
, with only one action below it, is because this config is generated by Ansible, and some of the hosts we manage do have other<store>
clauses included. (I wouldn't expect a copy, with only one destination, to be inherently problematic. Again, this works with fluent-package 5.x.)I did try removing the "weight" value from the server clause, this didn't help. Actual server IP and security credentials removed from below, of course.
Describe the logs of Fluentd
From starting fluentd manually on the CLI with
sudo /opt/fluent/bin/fluentd --verbose
:(and then it just sits there eating up CPU, and ultimately requires a
kill -9
to stopFrom the above, here's a
ps
of that startup attempt, after about an hour:So, after about an hour, the fluentd process has taken about an hour's worth of CPU. Similar logs and behavior seen when trying to start fluentd via systemctl.
Environment
Beta Was this translation helpful? Give feedback.
All reactions