This paper investigates the most appropriate Application Programming Interface (API) that best accelerates the flow-based applications on the Wireless Sensor Networks (WSNs). Each WSN include many sensor nodes which have limited resources. These sensor nodes are connected together using base stations. The base stations are commonly network systems with conventional processors which are responsible for handling large amount of communicated data in flows of network packets. For this purpose, classi cation of the communicated packets is considered as the primary process in such systems. With the advent of high-performance multi-core processors, developers in the network industry have considered these processors as a striking choice for implementing a wide range of flow-based wireless sensor networking applications. The main challenge in this eld is choosing and exploiting an API which best allows multi-threading; i.e. one which maximally hides the latency of performing complex operations by threads and increases the overall efficiency of the cores. This paper assesses the efficiency of Thread, Open Multi-Processing (OpenMP) and Threading Building Blocks (TBB) libraries in multi-thread implementation of Set-Pruning and Grid-of-Tries (GOT) packet classi cation algorithms on dual-core and quad-core processors. In all cases, the speed and throughput of all parallel versions of the classifi cation algorithms are much more than the corresponding serial versions. Moreover, for parallel classi cation of a sufficiently large number of packets by both classi fication algorithms, TBB library results in higher throughput and performance than the other libraries due to its automatic scheduling and internal task stealing mechanism.