Using pmap and gdb to find native memory leak

Question

I am debugging a native memory leak in java application. The rss is growing 1GB/day while heap showing no increase. On comparing the output of pmap over time, I see multiple anon blocks getting added either at the top of heap or between two native libraries. Can I say the memory increase between, say sssd_pac…

Accepted Answer

A very basic approach: you could try looking at who is calling mmap (and not munmap).attach to the processset breakpoint on mmap, with commands to print arguments and backtrace (maybe 5 frames) and continuesimilar thing for munmapredirect outputlet it run for a daydetachmatch mmaps with munmaps in the outputWith pmap periodically running on the side, you may be able to match newer anon regions with mmap backtraces (might need playing around with frame count).There is already this nice little article LINUX GDB: IDENTIFY MEMORY LEAKS to get you started.Note:you are looking for mmap and munmap, not malloc and freeyou will have to find out the offset of the return from mmapI have not tried the script from the article but I think it would do what the article claimsFinding mmap return instruction offset (from start of mmap):Just fire up gdb with any executable on the same host[ aquila ~ ] $ gdb -q /usr/bin/lsReading symbols from /usr/bin/ls...Reading symbols from /usr/bin/ls...(no debugging symbols found)...done.(no debugging symbols found)...done.Missing separate debuginfos, use: dnf debuginfo-install coreutils-8.27-5.fc26.x86_64(gdb) set pagination off(gdb) set breakpoint pending on(gdb) b mmapFunction "mmap" not defined.Breakpoint 1 (mmap) pending.(gdb) rStarting program: /usr/bin/lsBreakpoint 1, 0x00007ffff7df2940 in mmap64 () from /lib64/ld-linux-x86-64.so.2(gdb) disassembleDump of assembler code for function mmap64:=> 0x00007ffff7df2940 <+0>: test %rdi,%rdi 0x00007ffff7df2943 <+3>: push %r15 0x00007ffff7df2945 <+5>: mov %r9,%r15 : : 0x00007ffff7df2973 <+51>: mov $0x9,%eax : 0x00007ffff7df2982 <+66>: pop %rbx : 0x00007ffff7df298a <+74>: pop %r15 0x00007ffff7df298c <+76>: retq 0x00007ffff7df298d <+77>: nopl (%rax) : : 0x00007ffff7df29d8 <+152>: mov $0xffffffffffffffff,%rax 0x00007ffff7df29df <+159>: jmp 0x7ffff7df2982 End of assembler dump.Note the return instruction here:0x00007ffff7df298c <+76>: retqSo, on my machine, the second breakpoint would have to be set at (mmap+76).Once you determine this offset, you can verify this offset by attaching to your target process and disassembling what is at that offset. E.g. taking my current shell as my target process:[ aquila ~ ] $ echo $$9769[ aquila ~ ] $ gdb -q(gdb) attach 9769Attaching to process 9769Reading symbols from /usr/bin/bash...Reading symbols from /usr/bin/bash...(no debugging symbols found)...done.(no debugging symbols found)...done.Reading symbols from /lib64/libtinfo.so.6...Reading symbols from /lib64/libtinfo.so.6...(no debugging symbols found)...done.(no debugging symbols found)...done.Reading symbols from /lib64/libdl.so.2...(no debugging symbols found)...done.Reading symbols from /lib64/libc.so.6...(no debugging symbols found)...done.Reading symbols from /lib64/ld-linux-x86-64.so.2...(no debugging symbols found)...done.Reading symbols from /lib64/libnss_files.so.2...(no debugging symbols found)...done.0x00007fcfc67cc18a in waitpid () from /lib64/libc.so.6Missing separate debuginfos, use: dnf debuginfo-install bash-4.4.12-5.fc26.x86_64(gdb) x/i mmap+76 0x7fcfc680375c : retqI’m not very sure hbreak is required, plain old break might work as well.

Advertisement

Answer