Method and software for group floating-point arithmetic operations
First Claim
1. A method of processing data in a programmable processor, the method comprising:
- decoding instructions for execution using an execution unit coupled to a register file comprising a plurality of registers, the execution unit operable to execute instructions by partitioning data stored in registers in the register file into multiple data elements, the instructions selected from an instruction set that includes group arithmethic instructions and group data handling instructions;
in response to decoding different group data handling instructions, executing group data handling operations that re-arrange data elements in different ways and returning the re-arranged data to a register in the register file; and
in response to decoding different group arithmethic instructions, executing a plurality of different group floating-point and group integer arithmetic operations that each arithmetically operates on the multiple data elements stored in registers in the register file to produce a catenated result that is returned to a register in the register file, wherein the catenated result comprises a plurality of individual results, wherein the group arithmetic instructions include first, second, and third group multiply-and-add instructions each of which (i) partitions data in first and second registers in the register file into a first plurality and a second plurality of equal-sized data elements and partitions data in a third register in the register file into a third plurality of data elements which are equal in size to one another, (ii) multiplies each data element in the first register with a corresponding data element in the second register to produce a plurality of products, (iii) adds each product in the plurality of products to a corresponding data element in the third register to produce the plurality of individual results, and (iv) provides the plurality of individual results as the catenated result, wherein the first group multiply-and-add instruction multiplies data elements of 8-bit integer data and adds data elements of 16-bit integer data, the second group multiply-and-add instruction multiplies data elements of 16-bit integer data and adds data elements of 32-bit integer data, and the third group multiply-and-add instruction multiplies data elements of 32-bit floating point data and adds data elements of 32-bit floating-point data.
1 Assignment
0 Petitions
Accused Products
Abstract
Methods and software are presented for processing data in a programmable processor, involving (a) decoding instructions for execution using an execution unit operable to execute instructions by partitioning data stored in registers in a register file into multiple data elements, the instructions selected from an instruction set that includes group arithmethic instructions and group data handling instructions, (b) in response to decoding different group data handling instructions, executing group data handling operations that re-arrange data elements in different ways, and (c) in response to decoding different group arithmethic instructions, executing a plurality of different group floating-point and group integer arithmetic operations that each arithmetically operates on the multiple data elements stored in registers in the register file to produce a catenated result that is returned to a register in the register file, wherein the catenated result comprises a plurality of individual results.
-
Citations
22 Claims
-
1. A method of processing data in a programmable processor, the method comprising:
-
decoding instructions for execution using an execution unit coupled to a register file comprising a plurality of registers, the execution unit operable to execute instructions by partitioning data stored in registers in the register file into multiple data elements, the instructions selected from an instruction set that includes group arithmethic instructions and group data handling instructions;
in response to decoding different group data handling instructions, executing group data handling operations that re-arrange data elements in different ways and returning the re-arranged data to a register in the register file; and
in response to decoding different group arithmethic instructions, executing a plurality of different group floating-point and group integer arithmetic operations that each arithmetically operates on the multiple data elements stored in registers in the register file to produce a catenated result that is returned to a register in the register file, wherein the catenated result comprises a plurality of individual results, wherein the group arithmetic instructions include first, second, and third group multiply-and-add instructions each of which (i) partitions data in first and second registers in the register file into a first plurality and a second plurality of equal-sized data elements and partitions data in a third register in the register file into a third plurality of data elements which are equal in size to one another, (ii) multiplies each data element in the first register with a corresponding data element in the second register to produce a plurality of products, (iii) adds each product in the plurality of products to a corresponding data element in the third register to produce the plurality of individual results, and (iv) provides the plurality of individual results as the catenated result, wherein the first group multiply-and-add instruction multiplies data elements of 8-bit integer data and adds data elements of 16-bit integer data, the second group multiply-and-add instruction multiplies data elements of 16-bit integer data and adds data elements of 32-bit integer data, and the third group multiply-and-add instruction multiplies data elements of 32-bit floating point data and adds data elements of 32-bit floating-point data. - View Dependent Claims (2, 3, 4, 5, 6, 7, 8, 9)
-
-
10. A method of processing data in a programmable processor, the method comprising:
-
decoding instructions for execution using an execution unit coupled to a register file comprising a plurality of registers, the execution unit operable to execute instructions by partitioning data stored in operand registers in the register file into multiple data elements, the instructions selected from an instruction set that includes group arithmethic instructions and group data handling instructions;
in response to decoding different group data handling instructions, executing group data handling operations that re-arrange data elements in different ways and returning the re-arranged data to a register in the register file; and
in response to decoding different group arithmethic instructions, executing a plurality of different group floating-point and group integer arithmetic operations that each arithmetically operates on the multiple data elements stored in registers in the register file to produce a catenated result that is returned to a register in the register file, wherein the catenated result comprises a plurality of individual results, wherein the group arithmetic instructions include first, second, and third group multiply instructions, each of which (i) partitions data in first and second registers in the register file into a first plurality and a second plurality of equal-sized data elements, (ii) multiplies each data element in the first register with a corresponding data element in the second register to produce the plurality of individual results, which are equal in size to one another, (iii) provides the plurality of individual results as the catenated result, wherein the first group multiply instruction operates on data elements of 8-bit integer data and produces data elements of 16-bit integer data, the second group multiply instruction operates on data elements of 16-bit integer data and produces data elements of 32-bit integer data, and the third group multiply instruction operates on data elements of 32-bit floating-point data and produces data elements of 32-bit floating-point data. - View Dependent Claims (11)
-
-
12. A computer-readable storage medium:
-
having instructions stored thereon that instruct a computer system to perform operations using an execution unit coupled to a register file comprising a plurality of registers, by partitioning data stored in operand registers in the register file into multiple data elements, the instructions selected from an instruction set that includes group arithmethic instructions and group data handling instructions, the instructions including different group data handling instructions that cause the execution unit to re-arrange data elements in different ways and returning the re-arranged data to a register in the register file, the instructions further including different group arithmetic instructions for performing a plurality of different group floating-point and group integer arithmetic operations that each arithmetically operates on the multiple data elements stored in registers in the register file to produce a catenated result that is returned to a register in the register file, wherein the catenated result comprises a plurality of individual results, wherein the group arithmetic instructions include first, second, and third group multiply-and-add instructions each of which (i) partitions data in first and second registers in the register file into a first plurality and a second plurality of equal-sized data elements and partitions data in a third register in the register file into a third plurality of data elements which are equal in size to one another, (ii) multiplies each data element in the first register with a corresponding data element in the second register to produce a plurality of products, (iii) adds each product in the plurality of products to a corresponding data element in the third register to produce the plurality of individual results, and (iv) provides the plurality of individual results as the catenated result, wherein the first group multiply-and-add instruction multiplies data elements of 8-bit integer data and adds data elements of 16-bit integer data, the second group multiply-and-add instruction multiplies data elements of 16-bit integer data and adds data elements of 32-bit integer data, and the third group multiply-and-add instruction multiplies data elements of 32-bit floating point data and adds data elements of 32-bit floating-point data. - View Dependent Claims (13, 14, 15)
-
- 16. The computer-readable storage medium 15 wherein the group arithmetic instructions further include a group negate instruction that partitions a first register in the register file into a first plurality of equal-sized data elements and applies a negation function to each data element in the first register to produce a plurality of negated data elements and provide the plurality of negated data elements as the catenated result, wherein the group negate instruction operates on data elements of 32-bit floating-point data.
-
21. A computer-readable storage medium:
-
having instructions stored thereon that instruct a computer system to perform operations using an execution unit coupled to a register file comprising a plurality of registers, by partitioning data stored in operand registers in the register file into multiple data elements, the instructions selected from an instruction set that includes group arithmethic instructions and group data handling instructions, the instructions including different group data handling instructions that cause the execution unit to re-arrange data elements in different ways returning the re-arranged data to a register in the register file, the instructions further including different group arithmetic instructions for performing a plurality of different group floating-point and group integer arithmetic operations that each arithmetically operates on the multiple data elements stored in registers in the register file to produce a catenated result that is returned to a register in the register file, wherein the catenated result comprises a plurality of individual results, wherein the group arithmetic instructions include first, second, and third group multiply instructions, each of which (i) partitions data in first and second registers in the register file into a first plurality and a second plurality of equal-sized data elements, (ii) multiplies each data element in the first register with a corresponding data element in the second register to produce the plurality of individual results, which are equal in size to one another, (iii) provides the plurality of individual results as the catenated result, wherein the first group multiply instruction operates on data elements of 8-bit integer data and produces data elements of 16-bit integer data, the second group multiply instruction operates on data elements of 16-bit integer data and produces data elements of 32-bit integer data, and the third group multiply instruction operates on data elements of 32-bit floating-point data and produces data elements of 32-bit floating-point data. - View Dependent Claims (22)
-
Specification