MULTIPLYING AND ADDING MATRICES
First Claim
Patent Images
1. A method comprising:
- decoding by a decoder in a processor device, a single instruction specifying an m-by-m matrix operation for a set of vectors, wherein each vector represents an m-by-m matrix of data elements and m is greater than one;
issuing the single instruction for execution by an execution unit in the processor device; and
responsive to the execution of the single instruction, generating a resultant vector, wherein the resultant vector represents an m-by-m matrix of data elements.
1 Assignment
0 Petitions
Accused Products
Abstract
An apparatus and method are described for multiplying and adding matrices. For example, one embodiment of a method comprises decoding by a decoder in a processor device, a single instruction specifying an m-by-m matrix operation for a set of vectors, wherein each vector represents an m-by-m matrix of data elements and m is greater than one; issuing the single instruction for execution by an execution unit in the processor device; and responsive to the execution of the single instruction, generating a resultant vector, wherein the resultant vector represents an m-by-m matrix of data elements.
-
Citations
18 Claims
-
1. A method comprising:
-
decoding by a decoder in a processor device, a single instruction specifying an m-by-m matrix operation for a set of vectors, wherein each vector represents an m-by-m matrix of data elements and m is greater than one; issuing the single instruction for execution by an execution unit in the processor device; and responsive to the execution of the single instruction, generating a resultant vector, wherein the resultant vector represents an m-by-m matrix of data elements. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8)
-
-
9. A processor device comprising:
-
a storage location configured to store a first set of data elements, a second set of data elements, and a third set of data elements, wherein the first, second, and third sets of data elements represent respective m-by-m matrices and m is greater than one; a decoder configured to decode a single instruction specifying an m-by-m matrix multiply-add operation; and an execution unit coupled to the decoder to receive decoded instructions and coupled to the storage location to execute the m-by-m matrix multiply-add operation; wherein, responsive to executing the m-by-m matrix multiply-add operation, the execution unit is configured to generate a resultant vector representing an m-by-m result matrix. - View Dependent Claims (10, 11, 12, 13, 14)
-
-
15. A system comprising:
-
a memory controller coupled to a first storage location configured to store a first set of data elements; and a processor coupled to the memory controller, the processor comprising; a register file unit configured to store a second set of data elements, and a third set of data elements, wherein the first, second, and third sets of data elements represent respective m-by-m data matrices; a decoder configured to decode a single instruction specifying an m-by-m matrix multiply-add operation; and an execution unit coupled to the decoder to receive decoded instructions and coupled to the first storage location and register file unit to execute the m-by-m matrix multiply-add operation; wherein, responsive to executing the m-by-m matrix multiply-add operation, the execution unit is configured to generate a resultant vector representing an m-by-m result matrix. - View Dependent Claims (16, 17, 18)
-
Specification