Hi Timo, or to whom it may concern,
Your email awakes me in paying attention to details I did not thought about. Helpful.
When my simulations stall, the running time posting is this:
– This is ASPECT, the Advanced Solver for Problems in Earth’s ConvecTion.
– . version 2.0.1 (aspect-2.0, 2863594)
– . using deal.II 9.0.0
– . using Trilinos 12.10.1
– . using p4est 2.0.0
– . running in DEBUG mode
– . running with 1088 MPI processes
– How to cite ASPECT: https://aspect.geodynamics.org/cite.html
*** Resuming from snapshot!
Number of active cells: 124,153 (on 9 levels)
Number of degrees of freedom: 6,034,421 (3,526,572+156,801+1,175,524+1,175,524)
*** Timestep 4080: t=4.74463e+15 seconds
Solving temperature system… 3 iterations.
Solving C_1 system … 14 iterations.
Rebuilding Stokes preconditioner…
Solving Stokes system… 57+0 iterations.
Postprocessing:
%%%%%%%%%%%%%%%%%%%%%%%%%%
So it stalls at postprocessing.
These are my output settings:
subsection Postprocess
set List of postprocessors = velocity statistics, temperature statistics, heat flux statistics, visualization, composition statistics, particles
subsection Visualization
set List of output variables = density, viscosity # dynamic topography
set Time between graphical output = 3.15576e13
set Output format = vtu
set Number of grouped files = 1
I was also using particles, but I have disabled that postprocess to clean up the issue.
What I did then, regarding your questions, was to set different values for grouped files, I tried 8 and 0 (no grouping)
In both cases it failed, but at least it gave some more info:
Number of active cells: 124,153 (on 9 levels)
Number of degrees of freedom: 6,034,421 (3,526,572+156,801+1,175,524+1,175,524)
*** Timestep 4070: t=4.74214e+15 seconds
Solving temperature system… 3 iterations.
Solving C_1 system … 14 iterations.
Rebuilding Stokes preconditioner…
Solving Stokes system… 83+0 iterations.
Postprocessing:
An error occurred in line <5777> of file </work/04020/unfelipe/stampede2/software/candi/install/tmp/unpack/deal.II-v9.0.0/source/base/data_out_base.cc> in function
void dealii::DataOutBase::write_visit_record(std::ostream&, const std::vector<std::pair<double, std::vector<std::__cxx11::basic_string > > >&)
The violated condition was:
domain->second.size() == nblocks
Additional information:
piece_names should be a vector of equal sized vectors.
Stacktrace:
#0 /work/04020/unfelipe/stampede2/software/candi/install/deal.II-v9.0.0/lib/libdeal_II.g.so.9.0.0: dealii::DataOutBase::write_visit_record(std::ostream&, std::vector<std::pair<double, std::vector<std::__cxx11::basic_string<char, std::char_traits, std::allocator >, std::allocator<std::__cxx11::basic_string<char, std::char_traits, std::allocator > > > >, std::allocator<std::pair<double, std::vector<std::__cxx11::basic_string<char, std::char_traits, std::allocator >, std::allocator<std::__cxx11::basic_string<char, std::char_traits, std::allocator > > > > > > const&)
#1 /scratch/04020/unfelipe/software/candi/aspect/aspect: aspect::Postprocess::Visualization<3>::write_master_files(dealii::DataOut<3, dealii::DoFHandler<3, 3> > const&, std::__cxx11::basic_string<char, std::char_traits, std::allocator > const&, std::vector<std::__cxx11::basic_string<char, std::char_traits, std::allocator >, std::allocator<std::__cxx11::basic_string<char, std::char_traits, std::allocator > > > const&)
#2 /scratch/04020/unfelipe/software/candi/aspect/aspect: aspect::Postprocess::Visualization<3>::executeabi:cxx11
#3 /scratch/04020/unfelipe/software/candi/aspect/aspect: aspect::Postprocess::Manager<3>::executeabi:cxx11
#4 /scratch/04020/unfelipe/software/candi/aspect/aspect: aspect::Simulator<3>::postprocess()
#5 /scratch/04020/unfelipe/software/candi/aspect/aspect: aspect::Simulator<3>::run()
#6 /scratch/04020/unfelipe/software/candi/aspect/aspect: void run_simulator<3>(std::__cxx11::basic_string<char, std::char_traits, std::allocator > const&, bool, bool)
#7 /scratch/04020/unfelipe/software/candi/aspect/aspect: main
Calling MPI_Abort now.
To break execution in a GDB session, execute ‘break MPI_Abort’ before running. You can also put the following into your ~/.gdbinit:
set breakpoint pending on
break MPI_Abort
set breakpoint pending auto
application called MPI_Abort(MPI_COMM_WORLD, 255) - process 0
TACC: MPI job exited with code: 255
TACC: Shutdown complete. Exiting.
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
What should I do?
I think this issue should be solvable, as the simulation can actually run if I set an output time larger than about 10 Ma… So, it does run… so I guess it should be able to postprocess graphical output.
cheers,
Felipe