I added the following statement to a global function:
shared float temp[BLOCK_SIZE + MASK_WIDTH-1];
the compiler says this line has 3 errors:<