Overall, the malloc changeover has been pretty uneventful. Now that jemalloc has seen a bit wider exposure, I thought it might be useful to summarize the types of application bugs that it has been uncovering. -------- 1) Missing function prototypes for functions that return pointers. If a function prototype is missing, the compiler provides a default return type of int, but since amd64 is an LP64 architecture, this means that eight-byte pointers get cast to four-byte integers, even if they are then immediately stored in pointer variables. This pointer truncation isn't usually a problem with phkmalloc since it uses sbrk() to allocate space. However, jemalloc does not use sbrk() at all on amd64, so pretty much all malloc'ed objects are high enough in memory that more than 32 bits are required to store pointers to them. A simple way to determine whether this is a problem for an app is to #define USE_BRK in malloc.c, rebuild libc, then run your app something like: LD_PRELOAD=/usr/obj/usr/src/lib/libc/libc.so.6 startx Incidentally, this appears to be why xorg-server doesn't work on amd64. 2) Out-of-bounds writes. Lots of programs have been found to write past the end of the space they allocate. At the moment, jemalloc's redzone code is enabled, so these errors are causing messages to stderr that look like: ifconfig: (malloc) Corrupted redzone 1 byte after 0xa000150 (size 18) (0x0) In at least one case (running f2c while building the math/arpack port), these overruns would have caused actual malloc data structure corruption, had redzones not been enabled. 3) Invalid alignment assumptions. This class of bug has been less common than the first two, but it is definitely still an issue. The 'Q' and 'k' malloc configuration options provide an effective, if not very efficient, mechanism for digging into alignment issues. -------- Thanks for all the help so far with diagnosing various bugs, both in apps and in malloc. (Yes, there were some malloc bugs uncovered). We should expect to keep hitting pointer truncation bugs on amd64 for some time to come, but otherwise, I think we're pretty much out of the woods at this point. Naturally, if you need help diagnosing problems, I'll continue doing my best to help. Thanks, JasonReceived on Fri Jan 20 2006 - 06:10:20 UTC
This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:38:51 UTC