What exactly happens in the JVM when invoking an object’s instance method?

Question

I think I have finally found out how to word, what is giving me so much trouble in understanding: how the virtual machine can access a classes methods and use it only on a given instance (object) with the catch that the virtual machine is only being given the reference/pointer variable. This was compounded by…

Accepted Answer

Ordinary instance method invocations get compiled to invokevirtual instructions.This has been described in JVMS, §3.7. Invoking Methods:The normal method invocation for a instance method dispatches on the run-time type of the object. (They are virtual, in C++ terms.) Such an invocation is implemented using the invokevirtual instruction, which takes as its argument an index to a run-time constant pool entry giving the internal form of the binary name of the class type of the object, the name of the method to invoke, and that method&#8217;s descriptor (§4.3.3). To invoke the addTwo method, defined earlier as an instance method, we might write:int add12and13() {    return addTwo(12, 13);}This compiles to:Method int add12and13()0   aload_0             // Push local variable 0 (this)1   bipush 12           // Push int constant 123   bipush 13           // Push int constant 135   invokevirtual #4    // Method Example.addtwo(II)I8   ireturn             // Return int on top of operand stack;                        // it is the int result of addTwo()The invocation is set up by first pushing a reference to the current instance, this, on to the operand stack. The method invocation&#8217;s arguments, int values 12 and 13, are then pushed. When the frame for the addTwo method is created, the arguments passed to the method become the initial values of the new frame&#8217;s local variables. That is, the reference for this and the two arguments, pushed onto the operand stack by the invoker, will become the initial values of local variables 0, 1, and 2 of the invoked method.It’s up to the particular JVM implementation, how to perform the invocation at runtime, but using a vtable is very common. This basically matches the graphic in your question. The reference to the receiver object, which will become the this reference for the invoked method, is used to retrieve a method table.In the HotSpot JVM, the metadata structure is called Klass (actually a common name, even across different implementations). See “Object header layout” on the OpenJDK Wiki:An object header consists of a native-sized mark word, a klass word, a 32-bit length word (if the object is an array), a 32-bit gap (if required by alignment rules), and then zero or more instance fields, array elements, or metadata fields. (Interesting Trivia: Klass metaobjects contain a C++ vtable immediately after the klass word.)When resolving a symbolic reference to a method, its corresponding index in the table will be identified and remembered for subsequent invocations, as it never changes. Then, the entry of the actual object’s class can be used for the invocation. Subclasses will have the entries of the superclass, new methods appended to the end, with the entries of overridden methods replaced.This is the simple, unoptimized scenario. Most runtime optimizations work better when methods are inlined, to have the context of caller and callee in one piece of code to transform. Therefore, the HotSpot JVM will attempt inlining even for invokevirtual instructions to potentially overridable methods. As the wiki says:Virtual (and interface) invocations are often demoted to &#8220;special&#8221; invocations, if the class hierarchy permits it. A dependency is registered in case further class loading spoils things.Virtual (and interface) invocations with a lopsided type profile are compiled with an optimistic check in favor of the historically common type (or two types).Depending on the profile, a failure of the optimistic check will either deoptimize or run through a (slow) vtable/itable call.On the fast path of an optimistically typed call, inlining is common. The best case is a de facto monomorphic call which is inlined. Such calls, if back-to-back, will perform the receiver type check only once.This aggressive or optimistic inlining will sometime require Deoptimization but will usually yield an overall higher performance.

Advertisement

Answer