System and method for distributed graphics processing unit (GPU) computation

US 10,303,522 B2
Filed: 07/01/2017
Issued: 05/28/2019
Est. Priority Date: 07/01/2017
Status: Active Grant

First Claim

Patent Images

1. A system comprising:

a data processor; and

a distributed task management module, executable by the data processor, the distributed task management module being configured to;

receive a user task service request from a user node;

query resource availability from a plurality of slave nodes having a plurality of graphics processing units (GPUs) thereon, the plurality of slave nodes configured with multiple GPUs mounted on distributed processing containers;

generate a list of uniform resource locators (URLs), each URL on the list corresponding to a path to an available distributed processing container on the plurality of slave nodes;

issue the list of URLs to a load balancing node;

receive from the load balancing node an overall unique URL corresponding to the list of URLs;

use the overall unique URL to assign the user task service request to a plurality of available GPUs based on the resource availability and resource requirements of the user task service request, the assigning including using available distributed processing containers on the plurality of slave nodes; and

retain the list of URLs corresponding to the distributed processing containers assigned to the user task service request.

View all claims

2 Assignments

Timeline View

Assignment View

0 Petitions

Accused Products

Abstract

A system and method for distributed graphics processing unit (GPU) computation are disclosed. A particular embodiment includes: receiving a user task service request from a user node; querying resource availability from a plurality of slave nodes having a plurality of graphics processing units (GPUs) thereon; assigning the user task service request to a plurality of available GPUs based on the resource availability and resource requirements of the user task service request, the assigning including starting a service on a GPU using a distributed processing container and creating a corresponding uniform resource locator (URL); and retaining a list of URLs corresponding to the resources assigned to the user task service request.

Citations

14 Claims

1. A system comprising:
- a data processor; and
  
  a distributed task management module, executable by the data processor, the distributed task management module being configured to;
  
  receive a user task service request from a user node;
  
  query resource availability from a plurality of slave nodes having a plurality of graphics processing units (GPUs) thereon, the plurality of slave nodes configured with multiple GPUs mounted on distributed processing containers;
  
  generate a list of uniform resource locators (URLs), each URL on the list corresponding to a path to an available distributed processing container on the plurality of slave nodes;
  
  issue the list of URLs to a load balancing node;
  
  receive from the load balancing node an overall unique URL corresponding to the list of URLs;
  
  use the overall unique URL to assign the user task service request to a plurality of available GPUs based on the resource availability and resource requirements of the user task service request, the assigning including using available distributed processing containers on the plurality of slave nodes; and
  
  retain the list of URLs corresponding to the distributed processing containers assigned to the user task service request.
- View Dependent Claims (2, 3, 4, 5)
- - 2. The system of claim 1 wherein the distributed processing containers are implemented using a container-enabled system having a plurality of distributed containers for processing data.
  - 3. The system of claim 1 wherein the distributed task management module being configured to forward the overall unique URL to a user node that originated the user task service request.
  - 4. The system of claim 1 wherein the distributed task management module being configured to determine a number of GPUs mounted within an individual slave node and a number of resources available therein.
  - 5. The system of claim 1 wherein assigning the user task service request is based on whether the user task service request is a GPU intensive task or a central processing unit (CPU) intensive task.

6. A method comprising:
- receiving a user task service request from a user node;
  
  querying resource availability from a plurality of slave nodes having a plurality of graphics processing units (GPUs) thereon, the plurality of slave nodes configured with multiple GPUs mounted on distributed processing containers;
  
  generating a list of uniform resource locators (URLs), each URL on the list corresponding to a path to an available distributed processing container on the plurality of slave nodes;
  
  issuing the list of URLs to a load balancing node;
  
  receiving from the load balancing node an overall unique URL corresponding to the list of URLs;
  
  using the overall unique URL to assign the user task service request to a plurality of available GPUs based on the resource availability and resource requirements of the user task service request, the assigning including using available distributed processing containers on the plurality of slave nodes; and
  
  retaining the list of URLs corresponding to the distributed processing containers assigned to the user task service request.
- View Dependent Claims (7, 8, 9, 10)
- - 7. The method of claim 6 wherein the distributed processing containers are implemented using a container-enabled system having a plurality of distributed containers for processing data.
  - 8. The method of claim 6 including forwarding the overall unique URL to a user node that originated the user task service request.
  - 9. The method of claim 6 including determining a number of GPUs mounted within an individual slave node and a number of resources available therein.
  - 10. The method of claim 6 wherein assigning the user task service request is based on whether the user task service request is a GPU intensive task or a central processing unit (CPU) intensive task.

11. A non-transitory machine-useable storage medium embodying instructions which, when executed by a machine, cause the machine to:
- receive a user task service request from a user node;
  
  query resource availability from a plurality of slave nodes having a plurality of graphics processing units (GPUs) thereon, the plurality of slave nodes configured with multiple GPUs mounted on distributed processing containers;
  
  generate a list of uniform resource locators (URLs), each URL on the list corresponding to a path to an available distributed processing container on the plurality of slave nodes;
  
  issue the list of URLs to a load balancing node;
  
  receive from the load balancing node an overall unique URL corresponding to the list of URLs;
  
  use the overall unique URL to assign the user task service request to a plurality of available GPUs based on the resource availability and resource requirements of the user task service request, the assigning including using available distributed processing containers on the plurality of slave nodes; and
  
  retain the list of URLs corresponding to the distributed processing containers assigned to the user task service request.
- View Dependent Claims (12, 13, 14)
- - 12. The machine-useable storage medium of claim 11 wherein the distributed processing containers are implemented using a container-enabled system having a plurality of distributed containers for processing data.
  - 13. The machine-useable storage medium of claim 11 wherein the instructions being configured forward the overall unique URL to a user node that originated the user task service request.
  - 14. The machine-useable storage medium of claim 11 wherein assigning the user task service request is based on whether the user task service request is a GPU intensive task or a central processing unit (CPU) intensive task.

Specification

Resources

Litigation Campaign Assessment

Current Assignee
TuSimple, Inc. (TuSimple Holdings, Inc.)
Original Assignee
Tusimple
Inventors
Zhou, Kai, Liu, Siyuan
Primary Examiner(s)
McCulley, Ryan

Application Number

US15/640,510
Publication Number

US 20190004868A1
Time in Patent Office

696 Days
Field of Search
US Class Current
CPC Class Codes

G06F 16/9566   URL specific, e.g. using al...

G06F 2209/509   Offload

G06F 9/5044   considering hardware capabi...

G06F 9/505   considering the load

G06F 9/5055   considering software capabi...

G06F 9/5083   Techniques for rebalancing ...

G06T 1/20   Processor architectures; Pr...

System and method for distributed graphics processing unit (GPU) computation

First Claim

2 Assignments

0 Petitions

Accused Products

Abstract

Citations

14 Claims

Specification

Solutions

Use Cases

Quick Links

System and method for distributed graphics processing unit (GPU) computation

First Claim

2 Assignments

Subscription Required

Subscription Required

0 Petitions

Subscription Required

Accused Products

Subscription Required

Abstract

Citations

14 Claims

Specification

Subscription Required

Solutions

Use Cases

Quick Links