Instruction execution that broadcasts and masks data values at different levels of granularity
First Claim
1. An apparatus, comprising:
- an execution unit to execute a first instruction and a second instruction, said execution unit including the following;
input register space to store a first data structure to be replicated when executing said first instruction and to store a second data structure to be replicated when executing said second instruction, said first and second data structures both being packed data structures, data values of said first packed data structure being twice as large as data values of said second packed data structure;
replication logic circuitry to replicate said first data structure when executing said first instruction to create a first replication data structure, and, to replicate said second data structure when executing said second instruction to create a second replication data structure; and
masking logic circuitry to mask said first replication data structure at a first granularity and mask said second replication data structure at a second granularity, said second granularity being twice as fine as said first granularity.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus is described that includes an execution unit to execute a first instruction and a second instruction. The execution unit includes input register space to store a first data structure to be replicated when executing the first instruction and to store a second data structure to be replicated when executing the second instruction. The first and second data structures are both packed data structures. Data values of the first packed data structure are twice as large as data values of the second packed data structure. The execution unit also includes replication logic circuitry to replicate the first data structure when executing the first instruction to create a first replication data structure, and, to replicate the second data structure when executing the second data instruction to create a second replication data structure. The execution unit also includes masking logic circuitry to mask the first replication data structure at a first granularity and mask the second replication data structure at a second granularity. The second granularity is twice as fine as the first granularity.
21 Citations
20 Claims
-
1. An apparatus, comprising:
-
an execution unit to execute a first instruction and a second instruction, said execution unit including the following; input register space to store a first data structure to be replicated when executing said first instruction and to store a second data structure to be replicated when executing said second instruction, said first and second data structures both being packed data structures, data values of said first packed data structure being twice as large as data values of said second packed data structure; replication logic circuitry to replicate said first data structure when executing said first instruction to create a first replication data structure, and, to replicate said second data structure when executing said second instruction to create a second replication data structure; and masking logic circuitry to mask said first replication data structure at a first granularity and mask said second replication data structure at a second granularity, said second granularity being twice as fine as said first granularity. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method, comprising:
-
fetching a first data structure for a first instruction, said first data structure being a first packed data structure; executing said first instruction by replicating said first data structure within an execution unit to create a first replication data structure and masking said first replication data structure; fetching a second data structure for a second instruction, said second data structure being a second packed data structure, data values of said first packed data structure being twice as large as data values of said second packed data structure; and executing said second instruction by replicating said second data structure within said execution unit to create a second replication data structure and masking said second replication data structure at a granularity that is twice as fine as said masking of said first replication data structure. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. An apparatus, comprising:
-
an execution unit to execute a first instruction and a second instruction, said execution unit including the following; input register space to store a respective data structure to be replicated for each of a plurality of instructions to be executed by said execution unit, said respective data structure for each of said plurality of instructions including;
a first unpacked data structure for a first instruction;
a second unpacked data structure for a second instruction where the second unpacked data structure is twice as large as said first unpacked data structure, a third packed data structure for a third instruction whose data values are the same size as said first unpacked data structure, a fourth packed data structure for a fourth instruction whose data values are the same size as said second unpacked data structure;replication logic circuitry to;
replicate said first unpacked data structure when executing said first instruction to create a first replication data structure, replicate said second unpacked data structure when executing said second instruction to create a second replication data structure, replicate said third packed data structure when executing said third instruction to create a third replication data structure, replicate said fourth packed data structure when executing said fourth instruction to create a fourth replication data structure; andmasking logic circuitry to mask said first replication data structure and said third replication data structure at a first granularity and mask said second replication data structure and said fourth replication data structure at a second granularity, said first granularity being twice as fine as said second granularity. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification