Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Program hangs on dotnet9 rc2 #109172

Closed
ares128 opened this issue Oct 24, 2024 · 8 comments
Closed

Program hangs on dotnet9 rc2 #109172

ares128 opened this issue Oct 24, 2024 · 8 comments
Milestone

Comments

@ares128
Copy link
Contributor

ares128 commented Oct 24, 2024

Description

My program got stuck on dotnet 9 rc2 randomly, about once a day. Even Ctrl-C can not terminate the program on this situation.
but It is running ok for months on dotnet 8.

Here is dotnet version
dotnet --version
9.0.100-rc.2.24474.11

linux version(uname -a)
Linux ip-172-31-19-202 6.8.0-1017-aws #18~22.04.1-Ubuntu SMP Thu Oct 3 20:15:06 UTC 2024 aarch64 aarch64 aarch64 GNU/Linux
It is aws c7g.8xlarge.

Here is coredump backtrace.
(gdb) bt
#0 __futex_abstimed_wait_common64 (private=0, cancel=true, abstime=0x0, op=393, expected=0,
futex_word=0xb527a50a0d2c) at ./nptl/futex-internal.c:57
#1 __futex_abstimed_wait_common (cancel=true, private=0, abstime=0x0, clockid=0, expected=0,
futex_word=0xb527a50a0d2c) at ./nptl/futex-internal.c:87
#2 __GI___futex_abstimed_wait_cancelable64 (futex_word=futex_word@entry=0xb527a50a0d2c,
expected=expected@entry=0, clockid=clockid@entry=0, abstime=abstime@entry=0x0,
private=private@entry=0) at ./nptl/futex-internal.c:139
#3 0x0000f7c18c8cc8fc in __pthread_cond_wait_common (abstime=0x0, clockid=0,
mutex=0xb527a50a0cd0, cond=0xb527a50a0d00) at ./nptl/pthread_cond_wait.c:503
#4 ___pthread_cond_wait (cond=0xb527a50a0d00, mutex=0xb527a50a0cd0)
at ./nptl/pthread_cond_wait.c:627
#5 0x0000f7c18c62f988 in ?? ()
from /usr/share/dotnet/shared/Microsoft.NETCore.App/9.0.0-rc.2.24473.5/libcoreclr.so
#6 0x0000f7c18c62f6dc in ?? ()
from /usr/share/dotnet/shared/Microsoft.NETCore.App/9.0.0-rc.2.24473.5/libcoreclr.so
#7 0x0000f7c18c633e74 in ?? ()
from /usr/share/dotnet/shared/Microsoft.NETCore.App/9.0.0-rc.2.24473.5/libcoreclr.so
#8 0x0000f7c18c2dd548 in ?? ()
from /usr/share/dotnet/shared/Microsoft.NETCore.App/9.0.0-rc.2.24473.5/libcoreclr.so
#9 0x0000f7c18c2d8258 in ?? ()
from /usr/share/dotnet/shared/Microsoft.NETCore.App/9.0.0-rc.2.24473.5/libcoreclr.so
#10 0x0000f7c18c2de488 in ?? ()
from /usr/share/dotnet/shared/Microsoft.NETCore.App/9.0.0-rc.2.24473.5/libcoreclr.so
#11 0x0000f7c18c2d6810 in ?? ()
from /usr/share/dotnet/shared/Microsoft.NETCore.App/9.0.0-rc.2.24473.5/libcoreclr.so
#12 0x0000f7c18c5bf548 in ?? ()
from /usr/share/dotnet/shared/Microsoft.NETCore.App/9.0.0-rc.2.24473.5/libcoreclr.so
#13 0x0000f7c184301b2c in ?? ()
#14 0x84d00a98ffffff68 in ?? ()

attachment is core dump by gcore.
https://1drv.ms/u/c/6cfa9557e62383f8/EQ_vrxnyNJVCrRXYguj-FMQB3Qn4HkX42v0t2FDb-Akibg

Reproduction Steps

None

Expected behavior

None

Actual behavior

hangs

Regression?

No response

Known Workarounds

No response

Configuration

No response

Other information

No response

@dotnet-issue-labeler dotnet-issue-labeler bot added the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label Oct 24, 2024
@dotnet-policy-service dotnet-policy-service bot added the untriaged New issue has not been triaged by the area owner label Oct 24, 2024
@ares128
Copy link
Contributor Author

ares128 commented Oct 24, 2024

The core file is about 114M, where can i drop it?

@huoyaoyuan huoyaoyuan added area-VM-coreclr and removed needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners labels Oct 24, 2024
Copy link
Contributor

Tagging subscribers to this area: @mangod9
See info in area-owners.md if you want to be subscribed.

@mangod9
Copy link
Member

mangod9 commented Oct 24, 2024

Is there a simple repro you can provide for the issue? You can upload a dump by filing an issue here: https://developercommunity.visualstudio.com/dotnet/report.

@ares128
Copy link
Contributor Author

ares128 commented Oct 24, 2024

@mangod9
Copy link
Member

mangod9 commented Oct 24, 2024

Thanks for sharing the dump. This issue should be fixed with #108774 which is in the RTM build.

@Maoni0 FYI.

@mangod9 mangod9 removed the untriaged New issue has not been triaged by the area owner label Oct 24, 2024
@mangod9 mangod9 added this to the 9.0.0 milestone Oct 24, 2024
@ares128
Copy link
Contributor Author

ares128 commented Dec 9, 2024

@mangod9 Not fully addressed with RTM build, but less often.
Probably same issue as #110350.
Trying to save a coredump to share.

@mangod9
Copy link
Member

mangod9 commented Dec 9, 2024

Yeah there was another issue which we uncovered in RTM which is fixed by #110174. You can also try the workaround to disable BGC and check if that helps.

@ares128 ares128 closed this as completed Jan 25, 2025
@mangod9
Copy link
Member

mangod9 commented Jan 25, 2025

Thanks for confirming.

@github-actions github-actions bot locked and limited conversation to collaborators Feb 24, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

3 participants