COMPUTING REDUCTION AND PREFIX SUM OPERATIONS IN MEMORY
First Claim
Patent Images
1. An apparatus, comprising:
- processing circuitry configured to compute a reduction operation on data stored in a group of memory cells by;
splitting the data into a plurality of elements;
copying each of the plurality of elements into elements that are wider than before being copied; and
performing a logical operation associated with the reduction operation on each of the copied elements.
8 Assignments
0 Petitions
Accused Products
Abstract
The present disclosure includes apparatuses and methods for computing reduction and prefix sum operations in memory. A number of embodiments include processing circuitry configured to compute a reduction operation on data stored in a group of memory cells by splitting the data into a plurality of elements, copying each of the plurality of elements into elements that are wider than before being copied, and performing a logical operation associated with the reduction operation on each of the copied elements.
185 Citations
38 Claims
-
1. An apparatus, comprising:
processing circuitry configured to compute a reduction operation on data stored in a group of memory cells by; splitting the data into a plurality of elements; copying each of the plurality of elements into elements that are wider than before being copied; and performing a logical operation associated with the reduction operation on each of the copied elements. - View Dependent Claims (2, 3, 4, 5, 6)
-
7. An apparatus, comprising:
processing circuitry configured to compute a prefix sum operation on data stored in a group of memory cells by; splitting the data into a plurality of elements; copying each of the plurality of elements into elements that are wider than before being copied; and performing a logical operation associated with the prefix sum operation on each of the copied elements. - View Dependent Claims (8, 9, 10, 11, 12)
-
13. A method of operating processing circuitry, comprising:
computing a reduction operation on data stored in a group of memory cells in an array of memory cells by; splitting the data into a plurality of elements; copying each of the plurality of elements into elements that are wider than before being copied; and performing a logical operation associated with the reduction operation on each of the copied elements. - View Dependent Claims (14, 15, 16, 17, 18)
-
19. A method of operating processing circuitry, comprising:
computing a prefix sum operation on data stored in a group of memory cells in an array of memory cells by; splitting the data into a plurality of elements; copying each of the plurality of elements into elements that are wider than before being copied; and performing a logical operation associated with the prefix sum operation on each of the copied elements. - View Dependent Claims (20, 21, 22, 23, 24, 25)
-
26. An apparatus, comprising:
processing circuitry configured to compute a reduction operation on data stored in a group of memory cells by; splitting the data into a plurality of elements; copying each of the plurality of elements into elements twice as wide and half as tall as before being copied; and performing an addition operation on each of the copied elements to combine the copied elements. - View Dependent Claims (27, 28, 29, 30, 31)
-
32. A method of operating processing circuitry, comprising:
computing a prefix sum operation on data stored in a group of memory cells by; splitting the data into even and odd indexed elements; copying the even indexed elements into elements twice as wide and half as tall as before being copied, and copying the odd indexed elements into elements twice as wide and half as tall as before being copied; performing an addition operation on the copied even indexed elements and the copied odd indexed elements to combine the copied even and odd indexed elements; shifting an output of the addition operation by one element; performing another addition operation on the shifted output and the copied even indexed elements to combine the shifted output and the copied even indexed elements; and packing the combined shifted output and copied even indexed elements into elements half as wide and twice as tall as before the another addition operation. - View Dependent Claims (33, 34, 35, 36, 37, 38)
Specification