Profiling the C++ compilation process

C++CompilationProfiling

C++ Problem Overview


I tend to write rather large templated header-only C++ libraries and my users commonly complain about compilation times. After thinking about the matter, it occurred to me that I have no idea where the time is going. Is there some simple way to profile the C++ compilation process with common compilers, such as g++, icc, and xlC? For instance, is it possible to get an idea of how much time is spent within each of the phases of C++ compilation?

C++ Solutions


Solution 1 - C++

For GCC there are debugging options to find how much time is spent within each of the phases of C++ compilation?

> -Q Makes the compiler print out each function name as it is compiled, and print some statistics about each pass when it finishes.

>-ftime-report Makes the compiler print some statistics about the time consumed by each pass when it finishes.

Passes are described in GCCINT 9: Passes and Files of the Compiler.

You can post output of g++ compilation of single source file with -v -ftime-report here to discuss it. There could be some help on the GCC mailing list.


For compilers other than GCC (or GCC more ancient than 3.3.6) see the other options in this thread.

Solution 2 - C++

Clang 9 (and newer) has a -ftime-trace flag, which makes it output a profiling report as JSON (in addition to an object file).

You can import this file into a profiler that comes with Chrome (chrome://tracing) to get a nice visualisation:

pic

The bars correspond to headers that had to be parsed, and for each header, specific classes (and probably other constructs) that had to be parsed. It also reports time spent on instantiating specific templates.

Solution 3 - C++

There's a tool from the Boost project, which could be useful for pretty much any compiler and build system.

The tool requires source code instrumentation with TEMPLATE_PROFILE_ENTER() and TEMPLATE_PROFILE_EXIT() macro calls. These macros then generate specific diagnostics (warnings) at compile-time, which are timed and collected along with instantiation callstacks (which consequently allow building and visualizing callgraphs) by a script. Not bad, IMO.

I didn't use it yet though.

Solution 4 - C++

I have not yet tried it, but templight looks VERY promising: https://github.com/mikael-s-persson/templight

Solution 5 - C++

You can separate them out to some extent (I'm assuming make)

  • add a build rule that only preprocesses files (using the -E switch), and a .PHONY target that depends on the preprocessor output files just like the normal binary target depends on .o files. Measure how long it takes to build this target
  • add a 'PHONY target that depends on all the .o files but doesn't link them. Measure how long it takes to build this target (from clean)
  • measure how long it takes to do a clean build of the usual binary

Now you have some idea how long it takes to pre-process, compile, and link. You can also compare optimized and non-optimized (-O0) versions of the second and third target, to see how long is spent in the optimizer.

Solution 6 - C++

You might be able to get some traction with some variant on strace -e trace=process -f -r -ttt -T, at least for compilers like g++ that are broken up into many processes.

Solution 7 - C++

Others have already suggested the -ftime-report command line flag for GCC, which makes the compiler print some statistics about the time consumed by each compilation phase. The drawback is that it only shows summary for one unit.

I've written a Python script, which allows to print total summary on all units, by each compilation phase, given the project build log file. It also allows sorting by different phases. And it also allows to compare two log files (e.g., if you're trying to understand the impact of your changes).

Solution 8 - C++

Externis is a GCC plugin that will generate trace files very similar to clang's -ftime-trace:

enter image description here

Disclaimer: I'm the author of this plugin.

Attributions

All content for this solution is sourced from the original question on Stackoverflow.

The content on this page is licensed under the Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license.

Content TypeOriginal AuthorOriginal Content on Stackoverflow
QuestionJack PoulsonView Question on Stackoverflow
Solution 1 - C++osgxView Answer on Stackoverflow
Solution 2 - C++HolyBlackCatView Answer on Stackoverflow
Solution 3 - C++ulidtkoView Answer on Stackoverflow
Solution 4 - C++Paul BrannanView Answer on Stackoverflow
Solution 5 - C++UselessView Answer on Stackoverflow
Solution 6 - C++Phil MillerView Answer on Stackoverflow
Solution 7 - C++Alex CheView Answer on Stackoverflow
Solution 8 - C++unddochView Answer on Stackoverflow