Instruction execution that broadcasts and masks data values at different levels of granularity
First Claim
1. An apparatus, comprising:
- a decoder to decode a first instruction into a decoded first instruction, and a second instruction into a decoded second instruction; and
an execution unit to execute the decoded first instruction and the decoded second instruction, said execution unit including the following;
input register space to store a first data structure to be replicated when executing said decoded first instruction and to store a second data structure to be replicated when executing said decoded second instruction, said first and second data structures both being packed data structures, data values of said first packed data structure being twice as large as data values of said second packed data structure,replication logic circuitry to replicate said first data structure when executing said decoded first instruction to create a first replication data structure, and to replicate said second data structure when executing said decoded second instruction to create a second replication data structure, andmasking logic circuitry to mask said first replication data structure at a first granularity and mask said second replication data structure at a second granularity, said second granularity being twice as fine as said first granularity.
0 Assignments
0 Petitions
Accused Products
Abstract
An apparatus is described that includes an execution unit to execute a first instruction and a second instruction. The execution unit includes input register space to store a first data structure to be replicated when executing the first instruction and to store a second data structure to be replicated when executing the second instruction. The first and second data structures are both packed data structures. Data values of the first packed data structure are twice as large as data values of the second packed data structure. The execution unit also includes replication logic circuitry to replicate the first data structure when executing the first instruction to create a first replication data structure, and, to replicate the second data structure when executing the second data instruction to create a second replication data structure. The execution unit also includes masking logic circuitry to mask the first replication data structure at a first granularity and mask the second replication data structure at a second granularity. The second granularity is twice as fine as the first granularity.
19 Citations
20 Claims
-
1. An apparatus, comprising:
-
a decoder to decode a first instruction into a decoded first instruction, and a second instruction into a decoded second instruction; and an execution unit to execute the decoded first instruction and the decoded second instruction, said execution unit including the following; input register space to store a first data structure to be replicated when executing said decoded first instruction and to store a second data structure to be replicated when executing said decoded second instruction, said first and second data structures both being packed data structures, data values of said first packed data structure being twice as large as data values of said second packed data structure, replication logic circuitry to replicate said first data structure when executing said decoded first instruction to create a first replication data structure, and to replicate said second data structure when executing said decoded second instruction to create a second replication data structure, and masking logic circuitry to mask said first replication data structure at a first granularity and mask said second replication data structure at a second granularity, said second granularity being twice as fine as said first granularity. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method, comprising:
-
decoding a first instruction into a decoded first instruction, and a second instruction into a decoded second instruction with a decoder; fetching a first data structure for the first instruction, said first data structure being a first packed data structure; executing said decoded first instruction by replicating said first data structure within an execution unit to create a first replication data structure and masking said first replication data structure; fetching a second data structure for the second instruction, said second data structure being a second packed data structure, data values of said first packed data structure being twice as large as data values of said second packed data structure; and executing said decoded second instruction by replicating said second data structure within said execution unit to create a second replication data structure and masking said second replication data structure at a granularity that is twice as fine as said masking of said first replication data structure. - View Dependent Claims (9, 10, 11, 12, 13, 14)
-
-
15. An apparatus, comprising:
-
a decoder to decode a first instruction into a decoded first instruction, a second instruction into a decoded second instruction, a third instruction into a decoded third instruction, and a fourth instruction into a decoded fourth instruction; and an execution unit to execute the decoded first instruction, the decoded second instruction, the decoded third instruction, and the decoded fourth instruction, said execution unit including the following; input register space to store a respective data structure to be replicated for each of a plurality of decoded instructions to be executed by said execution unit, said respective data structure for each of said plurality of decoded instructions including;
a first unpacked data structure for the first instruction, a second unpacked data structure for the second instruction where the second unpacked data structure is twice as large as said first unpacked data structure, a third packed data structure for the third instruction whose data values are the same size as said first unpacked data structure, a fourth packed data structure for the fourth instruction whose data values are the same size as said second unpacked data structure,replication logic circuitry to;
replicate said first unpacked data structure when executing said decoded first instruction to create a first replication data structure, replicate said second unpacked data structure when executing said decoded second instruction to create a second replication data structure, replicate said third packed data structure when executing said decoded third instruction to create a third replication data structure, replicate said fourth packed data structure when executing said decoded fourth instruction to create a fourth replication data structure, andmasking logic circuitry to mask said first replication data structure and said third replication data structure at a first granularity and mask said second replication data structure and said fourth replication data structure at a second granularity, said first granularity being twice as fine as said second granularity. - View Dependent Claims (16, 17, 18, 19, 20)
-
Specification