-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Arista][T2] Seeing ports linkDown when PMON startup isn't delayed #19892
Comments
does it relate to port speed change (handled in voq system as port del and then port add)? |
Root Cause Analysis:
In parallel, with
This error causes the port event to be removed from
This removal of the port from the TLDR: Solution: Follow up questions: |
Issue is fixed with updating media_setting.json. Subsequent PR might be needed if their is bad input in |
#19657 recently merged to not delay the startup of
pmon
onSpineRouter
(T2) systems.This has resulted in a subset of ports to stay link down on our multi-asic LCs:
The set of ports which are down varies from boot to boot.
One thing that I noticed is that there are attributes missing from the ASIC_DB of the ports which are linkDown.
For example,
Ethernet240 (oid:0x10100000000000d)
was linkUp and had the following ASIC_DB attributes:However
Ethernet144 (oid:0x101000000000001)
was linkDown and had the following ASIC_DB attributes:Note that
'SAI_PORT_ATTR_FEC_MODE': 'SAI_PORT_FEC_MODE_RS'
is missing fromEthernet144
.But I see that the
APPL_DB
of bothEthernet144
andEthernet240
are identical, suggestingorchagent
didn't make propagate the state fromAPPL_DB
toASIC_DB
properly forEthernet144
.If I remove and re-add this
fec
attribute fromEthernet144
'sCONFIG_DB
entry I see it reflected inASIC_DB
and the port then comes up:The text was updated successfully, but these errors were encountered: