| 
BUG: PowerPC FPSCR Not Saved on Thread Context Switch
ID: Q158947
 
 | 
The information in this article applies to:
- 
Microsoft Win32 Software Development Kit (SDK) for Windows NT, versions  3.51, 4.0
SYMPTOMS
Multithreaded applications compiled for Windows NT on PowerPC processors
may show incorrect or unexpected floating point exception handling
behavior. When two or more threads in a process set their floating point
masks to different values, all of the threads get the same mask value as
the thread that set its mask last.
When this problem is encountered, the exact behavior of the application
will depend on which floating point exceptions are masked and unmasked and
how the application handles exceptions that do occur. Following is a
possible scenario:
- One thread unmasks a floating point exception and, some time
   after that, another unmasks a different exception. Then, during the
   course of computation the first thread generates an exception
   that it has unmasked but its exception handler does not get
   called. Or, it generates the exception that the second thread
   unmasked and its handler does get called but with an exception
   that it wasn't expecting.
This problem occurs only on PowerPC processors; Windows NT on other
processors does not exhibit this problem.
CAUSE
The PowerPC's floating point status and control register (FPSCR) is not
being saved during thread context switches; it is being treated as a per-
process resource rather than a per-thread resource.
STATUS
Microsoft has confirmed this to be a problem in Microsoft Windows NT
versions 3.51 and 4.0. Microsoft is researching this problem and will post
new information here as it becomes available.
MORE INFORMATION
The only time this behavior becomes a problem is when a multithreaded
application uses different floating point exception masks for each thread.
If the application is single-threaded or if all threads use the same
floating point exception mask, the problem does not occur.
The code below demonstrates the problem. The first thread unmasks floating
point divide by zero exceptions and then spawns another thread and waits
for it to exit. The spawned thread unmasks floating point underflow
exceptions and then exits. When the first thread returns from waiting for
the second, the FPSCR is set to floating point underflow exceptions being
unmasked but floating point divide by zero exceptions being masked. Thus,
the FPSCR value of the first thread's is now set to that of the second
thread. When the first thread performs the computation, the divide by zero
error is not sent to the exception handler.
Sample Code
To see the problem in a debugger as it happens, build this sample with
Microsoft Visual C++, and then set breakpoints on the lines with the
"BP x here" comments. As you step through the code, examine the value of
the FPSCR in the registers window of the debugger:
   #include <windows.h>
   #include <stdio.h>
   #include <float.h>
   
   DWORD WINAPI Thread (LPVOID lpvParam);
   
   void main (int argc, char ** argv)
   {
      volatile float x1 = 5.0F,
                     x2 = 0.0F;
      DWORD          tid;
      HANDLE         hThread;
   
      // Unmask floating point divide by zero exceptions for
      // this thread.
      _controlfp ((unsigned int)~_EM_ZERODIVIDE, _MCW_EM);   // BP 1 here
   
      // Create another thread
      hThread = CreateThread(NULL, 0, Thread, NULL, 0, &tid);
   
      // This will force a context switch.
      WaitForSingleObject (hThread, INFINITE);       // BP 2 here
      CloseHandle (hThread);
   
      // WHEN GET TO HERE, this thread's FPSCR should still
      // have the FP divide by zero exception unmasked.
      __try
      {
   
         x1 /= x2;   // cause FP divide by zero         // BP 3 here
   
      }
      __except (GetExceptionCode() == EXCEPTION_FLT_DIVIDE_BY_ZERO ?
                EXCEPTION_EXECUTE_HANDLER :
                EXCEPTION_CONTINUE_SEARCH)
      {
         // BP 4 on next line
         printf ("Trapped floating point divide by zero exception\n");
         _clearfp();
      }
      // BP 5 on next line
      printf ("past floating point divide by zero exception\n");
   }
   
   // This is a dummy thread that gets the default FPSCR settings
   // when created.
   DWORD WINAPI Thread (LPVOID lpvParam)
   {
      // Unmask floating point underflow exceptions for
      // this thread.
      _controlfp ((unsigned int)~_EM_UNDERFLOW, _MCW_EM);    // BP 6 here
      return 0;
   }
 
When this application works correctly, it should produce this output:
   Trapped floating point divide by zero exception
   past floating point divide by zero exception 
The incorrect output is:
   past floating point divide by zero exception 
Additional query words: 
3.51 4.00 kbdss fp floating point status control signal interrupt handler 
Keywords          : kbFloatPoint kbKernBase kbThread kbDSupport kbGrpKernBase 
Version           : NT:3.51,4.0
Platform          : NT 
Issue type        : kbbug