hypoelastic examples cases fix#152
Conversation
|
Not sure why this failed. Based on the case it failed on, it might have to do with #151? @lee-hyeoksu |
|
It fails when doing the silo check for NaNs. The CI jobs that don't check for NaNs in the silo file all pass |
|
I am trying to replicate the NaN on my local MFC, but the NaN seems not appear on the test. Could you let me know how Test Suite (ubuntu-latest, --no-debug, false) and Test Suite (ubuntu-latest, --no-debug, true, source /opt/intel/oneapi/setvars.sh) are different? Simply the former does not check for NaNs? @wilfonba |
|
I believe --no-debug, true, source /opt/intel/oneapi/setvars.sh) use Intel compilers. It must be compiler related since the GNU and NVHPC tests work fine. @anshgupta1234 added intel compilers before these cases were added, but it's still weird that they all passed when the PR was merged. |
|
Yeah, I don't know much about what that file does. What's weird is that the cases were fine for your PR, and this PR did nothing but change two example case files, and it's suddenly not fine. It doesn't make much sense. Maybe @henryleberre has an idea if @anshgupta1234 is busy? |
|
It is a very very rare case, but sometimes compilers have random errors. So just for check, how about to run the test again? @wilfonba |
|
It's failed on the CI for my other pull request, on this case or a similar one, several times, but only with the Intel compilers. |
|
I see, then it would be not the case. |
|
@lee-hyeoksu @wilfonba I just reran the CIs, but it looks like this is an actual problem and possibly has to do with Intel compilers. @lee-hyeoksu can you try loading the latest Intel compilers on Bridges2 or some other computer and then making sure it passes the tests? |
|
This is now passing @wilfonba. I am not sure what the issue was. |
|
No idea. Maybe an issue with whatever hardware/software GitHub is using for CI. |
|
I am currently setting up my access to Bridges2. I believe I will be able to test on it in a few days. The CI now passed but anyway I will test this on Bridges2 and I will let you know if there are issues. |
|
@lee-hyeoksu I re-ran the CI on the PR I merged from you on viscous + bubbles last week, and it actually failed at the Intel CPU test on your case: https://github.com/MFlowCode/MFC/actions/runs/5020414923 . I'm wondering if this is sporadic bug reproducible on a non-GitHub machine with Intel compilers. Since it is only happening for your PR, I suspect it has something to do with the code you added there. |
|
That error is thrown on step 51 when MFC tries to save a binary file (or whatever it is). This occurs before any silo or hdf5 routines are called, I believe, so this would have to be something wrong with MFC + Intel compilers. Do the serial output files in D/ have NaNs? |
|
Interesting. @lee-hyeoksu what happens if you use an |
|
@wilfonba I checked the serial outputs in |
|
I tried to figure out what happens in test 555332334 and I found that the NaNs actually occur at the first time step when populating buffers in In this subroutine, global parameters
Still I am not sure why this happens for intel compilers, so I keep looking into this issue but I just wanted to share. Any suggestions would be appreciated! |
|
I can't get any of the Intel compilers to work on Bridges2... any ideas anyone? @henryleberre @anshgupta1234 |
|
I tried intel compiler with optimization level O0 (no optimization) by adding a line of code Also, all tests pass with optimization level O1. @wilfonba Could you try this change on your PS? |
|
@lee-hyeoksu Thanks, this is helpful to know. I have moved discussion of this problem to an issue: #156 . I may create a separate issue if Update: |

Hypoelastic test cases wouldn't run because
case_dirwas removed from the code