Manycore

Manycore processors are chips containing a large number of simpler, independent processor cores (e.g. 10's, 100's or 1000's). They are distinct from Multi-core processors in that: they are optimised from the outset for a higher degree of explicit parallelism, and for higher throughput (or lower power consumption) at the expense of latency and lower single thread performance.

Multi-core processors, by contrast are designed to run both parallel code and legacy serial code, and therefore place more emphasis on single thread performance (e.g. devoting more silicon to out of order execution, deeper pipelines or more superscalar execution units), and shared memory. These techniques devote runtime resources toward figuring out implicit parallelism in a single thread. They are used in systems where they have evolved continuously (with backward compatibility) from single core processors. They usually have a 'few' cores (e.g. 2,4,8), and may be complemented by a manycore accelerator (such as a GPU) in a heterogeneous system.

Cache coherency is an issue limiting the scaling of multicore processors. Manycore processors may bypass this with tricks such as message passing, DMA, scratchpad memory, partitioned global address space, or read-only caches. A manycore processor using a network on a chip gives software the opportunity to explicitly optimise the spatial layout of tasks (e.g. as seen in tooling developed for TrueNorth).

Manycore processors may have more in common (conceptually) with technologies originating in high performance computing such as clusters and vector processors.

GPUs may be considered a form of manycore processor having multiple shader processing units, and only being suitable for highly parallel code.

Suitable programming models

Examples

See also

External links

This article is issued from Wikipedia - version of the Thursday, May 05, 2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.