標籤:c 內聯彙編 函數調用 局部變數 asm
函數調用處理過程
The C calling convention in 16-bit programs is as follows. In the following description, the words caller and callee are used to denote the function doing the calling and the function which gets called.
- The caller pushes the function‘s parameters on the stack, one after another, in reverse order (right to left, so that the first argument specified to the function is pushed last).
- The caller then executes a
CALL instruction to pass control to the callee. This CALL is either near or far depending on the memory model.
- The callee receives control, and typically (although this is not actually necessary, in functions which do not need to access their parameters) starts by saving the value of
SP in BP so as to be able to use BP as a base pointer to find its parameters on the stack. However, the caller was probably doing this too, so part of the calling convention states that BP must be preserved by any C function. Hence the callee, if it is going to set up BP as a frame pointer, must push the previous value first.
- The callee may then access its parameters relative to
BP. The word at [BP] holds the previous value of BP as it was pushed; the next word, at [BP+2], holds the offset part of the return address, pushed implicitly byCALL. In a small-model (near) function, the parameters start after that, at [BP+4]; in a large-model (far) function, the segment part of the return address lives at [BP+4], and the parameters begin at [BP+6]. The leftmost parameter of the function, since it was pushed last, is accessible at this offset from BP; the others follow, at successively greater offsets. Thus, in a function such as printf which takes a variable number of parameters, the pushing of the parameters in reverse order means that the function knows where to find its first parameter, which tells it the number and type of the remaining ones.
- The callee may also wish to decrease
SP further, so as to allocate space on the stack for local variables, which will then be accessible at negative offsets from BP.
- The callee, if it wishes to return a value to the caller, should leave the value in
AL, AX or DX:AX depending on the size of the value. Floating-point results are sometimes (depending on the compiler) returned in ST0.
- Once the callee has finished processing, it restores
SP from BP if it had allocated local stack space, then pops the previous value of BP, and returns via RETN or RETF depending on memory model.
- When the caller regains control from the callee, the function parameters are still on the stack, so it typically adds an immediate constant to
SP to remove them (instead of executing a number of slow POPinstructions). Thus, if a function is accidentally called with the wrong number of parameters due to a prototype mismatch, the stack will still be returned to a sensible state since the caller, which knows how many parameters it pushed, does the removing.
引用自NASM手冊調用過程樣本及測試案例
/* 以三個參數為例子,AT&T32彙編,棧向低地址增長fun(p0, p1, p2);1.參數入棧,right->leftgcc的做法: 其中p0 p1 p2姑且認為是立即數movl $p2 8(%esp)movl $p1 4(%esp)movl $p0 (%esp) |_______|ebp->|_______| |_______| |_______| |_______| |__p2___| |__p1___| |__p0___|<-esp |_______| |_______| |_______|2.call fun |_______|ebp->|_______| |_______| |_______| |_______| |__p2___| |__p1___| |__p0___| |__ret__|<-esp |_______| |_______|3.跳轉到fun代碼push %ebpmovl %esp, %ebp |_______| |_______| |_______| |_______| |__p2___| |__p1___| |__p0___| |__ret__|ebp->|__ebp__|<-esp |_______| |_______| 其中儲存的ebp是第2步中的ebp,即之前的ebp,不是圖左面的ebp如果棧向低地址增長那麼old ebp = [ebp]ret = [ebp + 4]p0 = [ebp + 8]p1 = [ebp + 12]p2 = [ebp + 16] 4.decrease ESP further, so as to allocate space on the stack for local variables,which will then be accessible at negative offsets from EBP. subl $24, %esp根據具體需要分配棧空間,這裡樣本24/4=6個4B局部變數 |_______| |_______| |_______| |_______| |__p2___| |__p1___| |__p0___| |__ret__|ebp->|__ebp__| |_______| |_______| |_______|<= esp + 12 = ebp - 12 |_______| |_______| |_______|<-esp |_______| |_______| 5.返回5.1 調整棧指標方式1:addl$24, %esp方式2:movl %ebp, %esp |_______| |_______| |_______| |_______| |__p2___| |__p1___| |__p0___| |__ret__|ebp->|__ebp__|<-esp |_______| |_______| |_______| |_______| |_______| |_______| |_______| |_______| 5.2彈出舊frame指標=銷毀當前棧幀popl%ebp |_______|ebp->|_______| |_______| |_______| |_______| |__p2___| |__p1___| |__p0___| |__ret__|<-esp |__ebp__| |_______| |_______| |_______| |_______| |_______|5.3 返回調主ret |_______|ebp->|_______| |_______| |_______| |_______| |__p2___| |__p1___| |__p0___|<-esp |__ret__| |__ebp__| |_______| |_______| |_______|上面說要調整esp,把參數移出掉。gcc沒有做這一步。esp就保留在這裡當下面再次調用函數時,舊值就被覆蓋掉。參看第一步!5.1和5.2步,可以使用leave指令代替 *///驗證#include <stdio.h>#include <stdlib.h> void fun(int a, int b){int local;unsigned int i, j;long ret;local = 12;a = 10;//printf("%d\n", a);#if 1asm volatile ("movl %%ebp, %%eax\n\t""movl %%esp, %%ebx\n\t":"=a"(i), "=b"(j):);printf("ebp = %p; esp = %p\n", i, j);printf("local auto var : %d\n", i - j); //局部變數佔用空間asm volatile ("movl 12(%%ebp), %%eax\n\t"//參數b"movl 8(%%ebp), %%ebx\n\t" //參數a"movl 4(%%ebp), %%ecx\n\t" //ret:"=a"(i), "=b"(j), "=c"(ret):);printf("p2 = %x; p1 = %x; ret = %p\n", i, j, ret);#endif}int main(int argc, char *argv[]){fun(1, 2);exit(0);}總結
1-如果自行編寫彙編,那麼就要自行處理這些工作,以及對某些寄存器的壓棧出棧的保護措施
2-第5.2步,當ebp彈出後,一個棧幀就不存在了,即函數環境已經撤銷,因此如果fun內return一個局部變數的引用,也就無意義了。這是最常見的問題!!!
3-局部變數空間的分配,換言之,第4步,對esp的調整,其大小要根據編譯時間符號表而定!
C函數調用原理