APPARATUS AND METHOD OF IMPROVED INSERT INSTRUCTIONS
First Claim
1. An apparatus, comprising:
- instruction execution logic circuitry to execute;
a) a first instruction and a second instruction, where, both said first instruction and said second instruction insert a first group of input vector elements to one of multiple first non overlapping sections of respective first and second resultant vectors, said first group having a first bit width, each of said multiple first non overlapping sections having a same bit width as said first group;
b) a third instruction and a fourth instruction, where, both said third instruction and said fourth instruction insert a second group of input vector elements to one of multiple second non overlapping sections of respective third and fourth resultant vectors, said second group having a second bit width that is larger than said first bit width, each of said multiple second non overlapping sections having a same bit width as said second group;
masking layer circuitry to mask said first and third instructions at a first resultant vector granularity, and, mask said second and fourth instructions at a second resultant vector granularity.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus is described having instruction execution logic circuitry to execute first, second, third and fourth instruction. Both the first instruction and the second instruction insert a first group of input vector elements to one of multiple first non overlapping sections of respective first and second resultant vectors. The first group has a first bit width. Each of the multiple first non overlapping sections have a same bit width as the first group. Both the third instruction and the fourth instruction insert a second group of input vector elements to one of multiple second non overlapping sections of respective third and fourth resultant vectors. The second group has a second bit width that is larger than said first bit width. Each of the multiple second non overlapping sections have a same bit width as the second group. The apparatus also includes masking layer circuitry to mask the first and third instructions at a first resultant vector granularity, and, mask the second and fourth instructions at a second resultant vector granularity.
22 Citations
20 Claims
-
1. An apparatus, comprising:
-
instruction execution logic circuitry to execute; a) a first instruction and a second instruction, where, both said first instruction and said second instruction insert a first group of input vector elements to one of multiple first non overlapping sections of respective first and second resultant vectors, said first group having a first bit width, each of said multiple first non overlapping sections having a same bit width as said first group; b) a third instruction and a fourth instruction, where, both said third instruction and said fourth instruction insert a second group of input vector elements to one of multiple second non overlapping sections of respective third and fourth resultant vectors, said second group having a second bit width that is larger than said first bit width, each of said multiple second non overlapping sections having a same bit width as said second group; masking layer circuitry to mask said first and third instructions at a first resultant vector granularity, and, mask said second and fourth instructions at a second resultant vector granularity. - View Dependent Claims (2, 3, 4, 5, 6, 7)
-
-
8. A method, comprising:
-
executing a first instruction including inserting a first group of input vector elements to one of multiple first non overlapping sections of a first resultant vector, said first group having a first bit width, each of said multiple first non overlapping sections having a same bit width as said first group, masking said first group at a first granularity; executing a second instruction including inserting a second group of input vector elements to one of multiple second non overlapping sections of a second resultant vector, said second group having a second bit width, each of said multiple second non overlapping sections having a same bit width as said second group, masking said second group a second granularity, said first granularity being finer than said second granularity; executing a third instruction including inserting a third group of input vector elements to one of multiple third non overlapping sections of a third resultant vector, said third group having said first bit width, each of said multiple third non overlapping sections having a same bit width as said first group, masking said third group at said second granularity; and
,executing a fourth instruction including inserting a fourth group of input vector elements to one of multiple fourth non overlapping sections of a fourth resultant vector, said fourth group having said second bit width, each of said multiple fourth non overlapping sections having a same bit width as said second group, masking said fourth group at said first granularity. - View Dependent Claims (9, 10, 11, 12, 13, 14, 15)
-
-
16. An apparatus, comprising:
-
instruction execution logic circuitry to execute; a) a first instruction and a second instruction, where, both said first instruction and said second instruction insert a first group of input vector elements to one of multiple first non overlapping sections of respective first and second resultant vectors in accordance with first and second immediate operands respectively, said first group having a first bit width, each of said multiple first non overlapping sections having a same bit width as said first group; b) a third instruction and a fourth instruction, where, both said third instruction and said fourth instruction insert a second group of input vector elements to one of multiple second non overlapping sections of respective third and fourth resultant vectors in according with third and fourth immediate operands respectively, said second group having a second bit width that is larger than said first bit width, each of said multiple second non overlapping sections having a same bit width as said second group; masking layer circuitry to mask said first and third instructions at a first resultant vector granularity, and, mask said second and fourth instructions at a second resultant vector granularity. - View Dependent Claims (17, 18, 19, 20)
-
Specification