[Question] Specified group generation id is not valid

See original GitHub issue

Describe the bug Consumers (sometimes) encounters the following exception: Specified group generation id is not valid. The consumers that encounters that error become a kind of Zombie. They are still connected as consumers to a partition but not consuming messages.

To Reproduce Can’t reproduce

Expected behavior The consumer should reconnect to the consumer grouop.

Observed behavior Logs:

[ConsumerGroup] Consumer has joined the group
The group is rebalancing, so a rejoin is needed
Specified group generation id is not valid

Environment:

OS: [alpine]
KafkaJS version [1.15.0]
Kafka version [2.4.1]
NodeJS version [14]

Additional context It’s probably not related to KafkaJs, but there is a mention of that error error.js and maybe you have any idea why it’s happening?

Issue Analytics

State:
Created 3 years ago
Comments:8 (1 by maintainers)

Top GitHub Comments

1reaction

Nevoncommented, Jan 28, 2021

The groupGenerationId is something we get from the broker (generation_id) in the JoinGroup response, and it’s just a number that increments with each generation in the group.

You’ll get this error when you try to commit after having been kicked out of the consumer group. This could for example happen if you spend too long in between heartbeats (because you’re processing a single message for too long, for example). What should happen is that you should re-join the group and get the new generation id to use.

It would be helpful if you could run with DEBUG log level so that we can see what requests are being made when this happens.

The consumers that encounters that error become a kind of Zombie. They are still connected as consumers to a partition but not consuming messages.

This is a shot in the dark, but do you maybe have more consumer instances than you do partitions? If so, some of your consumers will not be assigned any partitions, and thus won’t be doing any work.

0reactions

guiestimoneoncommented, Dec 23, 2022

Hello guys

I am having this issue when I scale my application horizontally. The pod is processing normally and out of nowhere I get this error: