Processing device with intuitive learning capability
First Claim
47. A processing device having one or more objectives, comprising:
- a probabilistic learning module having a learning automaton configured for learning a plurality of processor actions in response to a plurality of actions performed by a user; and
an intuition module configured for modifying a functionality of said probabilistic learning module based on said one or more objectives.
1 Assignment
0 Petitions
Accused Products
Abstract
A method and apparatus for providing learning capability to processing device, such as a computer game, educational toy, telephone, or television remote control, is provided to achieve one or more objective. For example, if the processing device is a computer game, the objective may be to match the skill level of the game with that of a player. If the processing device is an educational toy, the objective may be to increase the educational level of a user. If the processing device is a telephone, the objective may be to anticipate the phone numbers that a phone user will call. If the processing device is a television remote control, the objective may be to anticipate the television channels that will watched by the user. One of a plurality of actions (e.g., game actions, educational prompts, listed phone numbers, or listed television channels) to be performed on the processing device is selected. A user input indicative of a user action (e.g., a player action, educational input, called phone number, or watched television channel) is received. An outcome of the selected action and/or user action is determined. For example, in the case of a computer game, the outcome may indicate whether a computer-manipulated object has intersected a user-manipulated object. In the case of an educational toy, the outcome may indicate whether a user action matches a prompt generated by the educational toy. In the case of a telephone, the outcome may indicate whether a called phone number is on a list of phone numbers. In the case of a television remote control, the outcome may indicate whether a watched television channel is on a list of television channels. An action probability distribution that includes probability values corresponding to the plurality of actions is then updated based on the determined outcome. The next action will then be selected based on this updated action probability distribution. The foregoing steps can be modified based on a performance index to achieve the objective of the processing device so that it learns.
91 Citations
1037 Claims
-
47. A processing device having one or more objectives, comprising:
-
a probabilistic learning module having a learning automaton configured for learning a plurality of processor actions in response to a plurality of actions performed by a user; and
an intuition module configured for modifying a functionality of said probabilistic learning module based on said one or more objectives. - View Dependent Claims (48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64)
-
-
65. A method of providing learning capability to a computer game having an objective of matching a skill level of said computer game with a skill level of a game player, comprising:
-
identifying a move performed by said game player;
selecting one of a plurality of game moves based on a game move probability distribution comprising a plurality of probability values corresponding to said plurality of game moves;
determining an outcome of said selected game move relative to said identified player move;
updating said game move probability distribution based on said outcome; and
modifying one or more of said game move selection, said outcome determination, and said game move probability distribution update based on said objective. - View Dependent Claims (66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106)
-
-
107. A computer game having an objective of matching a skill level of said computer game with a skill level of a game player, comprising:
-
a probabilistic learning module configured for learning a plurality of game moves in response to a plurality of moves performed by a game player; and
an intuition module configured for modifying a functionality of said probabilistic learning module based on said objective. - View Dependent Claims (108, 109, 110, 111, 114, 121)
-
-
122. A method of providing learning capability to a processing device, comprising:
-
generating an action probability distribution comprising a plurality of probability values organized among a plurality of action subsets, said plurality of probability values corresponding to a plurality of processor actions;
selecting one of said plurality of action subsets; and
selecting one of said plurality of processor actions from said selected action subset. - View Dependent Claims (123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140)
-
-
141. A method of providing learning capability to a computer game, comprising:
-
generating a game move probability distribution comprising a plurality of probability values organized among a plurality of game move subsets, said plurality of probability values corresponding to a plurality of game moves;
selecting one of said plurality of game move subsets; and
selecting one of said plurality of game moves from said selected game move subset. - View Dependent Claims (142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174)
-
-
175. A method of providing learning capability to a processing device, comprising:
-
generating an action probability distribution using one or more learning algorithms, said action probability distribution comprising a plurality of probability values corresponding to a plurality of processor actions;
modifying said one or more learning algorithms; and
updating said action probability distribution using said modified one or more learning algorithms. - View Dependent Claims (176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198)
-
-
199. A method of providing learning capability to a computer game, comprising:
-
generating a game move probability distribution using one or more learning algorithms, said game move probability distribution comprising a plurality of probability values corresponding to a plurality of game moves;
modifying said one or more learning algorithms; and
updating said game move probability distribution using said modified one or more learning algorithms. - View Dependent Claims (200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236)
-
-
237. A method of matching a skill level of game player with a skill level of a computer game, comprising:
-
identifying a move performed by said game player;
selecting one of a plurality of game moves based on a game move probability distribution comprising a plurality of probability values corresponding to said plurality of game moves;
determining if said selected game move is successful;
determining a current skill level of said game player relative to a current skill level of said computer game; and
updating said game move probability distribution using a reward algorithm if said selected game move is successful and said relative skill level is relatively high, or if said selected game move is unsuccessful and said relative skill level is relatively low. - View Dependent Claims (238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248)
-
-
249. A method of matching a skill level of game player with a skill level of a computer game, comprising:
-
identifying a move performed by said game player;
selecting one of a plurality of game moves based on a game move probability distribution comprising a plurality of probability values corresponding to said plurality of game moves;
determining if said selected game move is successful;
determining a current skill level of said game player relative to a current skill level of said computer game; and
updating said game move probability distribution using a penalty algorithm if said selected game move is unsuccessful and said relative skill level is relatively high, or if said selected game move is successful and said relative skill level is relatively low. - View Dependent Claims (250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260)
-
-
261. A method of matching a skill level of game player with a skill level of a computer game, comprising:
-
identifying a move performed by said game player;
selecting one of a plurality of game moves based on a game move probability distribution comprising a plurality of probability values corresponding to said plurality of game moves;
determining if said selected game move is successful;
determining a current skill level of said game player relative to a current skill level of said computer game;
updating said game move probability distribution using a reward algorithm if said selected game move is successful and said relative skill level is relatively high, or if said selected game move is unsuccessful and said relative skill level is relatively low; and
updating said game move probability distribution using a penalty algorithm if said selected game move is unsuccessful and said relative skill level is relatively high, or if said selected game move is successful and said relative skill level is relatively low. - View Dependent Claims (262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272)
-
-
273. A method of matching a skill level of game player with a skill level of a computer game, comprising:
-
identifying a move performed by said game player;
selecting one of a plurality of game moves based on a game move probability distribution comprising a plurality of probability values corresponding to said plurality of game moves;
determining if said selected game move is successful;
determining a current skill level of said game player relative to a current skill level of said computer game;
generating a successful outcome if said selected game move is successful and said relative skill level is relatively high, or if said selected game move is unsuccessful and said relative skill level is relatively low; and
updating said game move probability distribution based on said successful outcome. - View Dependent Claims (274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284)
-
-
285. A method of matching a skill level of game player with a skill level of a computer game, comprising:
-
identifying a move performed by said game player;
selecting one of a plurality of game moves based on a game move probability distribution comprising a plurality of probability values corresponding to said plurality of game moves;
determining if said selected game move is successful;
determining a current skill level of said game player relative to a current skill level of said computer game;
generating an unsuccessful outcome if said selected game move is unsuccessful and said relative skill level is relatively high, or if said selected game move is successful and said relative skill level is relatively low; and
updating said game move probability distribution based on said unsuccessful outcome. - View Dependent Claims (286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296)
-
-
297. A method of matching a skill level of game player with a skill level of a computer game, comprising:
-
identifying a move performed by said game player;
selecting one of a plurality of game moves based on a game move probability distribution comprising a plurality of probability values corresponding to said plurality of game moves;
determining if said selected game move is successful;
determining a current skill level of said game player relative to a current skill level of said computer game;
generating a successful outcome if said selected game move is successful and said relative skill level is relatively high, or if said selected game move is successful and said relative skill level is relatively low;
generating an unsuccessful outcome if said selected game move is unsuccessful and said relative skill level is relatively high, or if said selected game move is successful and said relative skill level is relatively low; and
updating said game move probability distribution based on said successful outcome and said unsuccessful outcome. - View Dependent Claims (298, 299, 300, 301, 302, 303, 304, 305, 306, 307, 308)
-
-
309. A method of providing learning capability to a processing device, comprising:
-
generating an action probability distribution comprising a plurality of probability values corresponding to a plurality of processor actions; and
transforming said action probability distribution. - View Dependent Claims (310, 311, 312, 313, 314, 315, 316, 317, 318, 319)
-
-
320. A method of providing learning capability to a computer game, comprising:
-
generating a game move probability distribution comprising a plurality of probability values corresponding to a plurality of game moves; and
transforming said game move probability distribution. - View Dependent Claims (321, 322, 323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340)
-
-
341. A method of providing learning capability to a processing device, comprising:
-
generating an action probability distribution comprising a plurality of probability values corresponding to a plurality of processor actions; and
limiting one or more of said plurality of probability values. - View Dependent Claims (342, 343, 344, 345, 346, 347, 348, 349, 350, 351)
-
-
352. A method of providing learning capability to a computer game, comprising:
-
generating a game move probability distribution comprising a plurality of probability values corresponding to a plurality of game moves; and
limiting one or more of said plurality of probability values. - View Dependent Claims (353, 354, 355, 356, 357, 358, 359, 360, 361, 362, 363, 364, 365)
-
-
366. A method of providing learning capability to a processing device, comprising:
-
identifying an action performed by a user;
selecting one of a plurality of processor actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions;
determining an outcome of one or both of said identified user action and said selected processor action;
updating said action probability distribution based on said outcome; and
wherein said action probability distribution is prevented from substantially converging to a single probability value. - View Dependent Claims (367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393)
-
-
394. A processing device, comprising:
-
a probabilistic learning module configured for learning a plurality of processor actions in response to a plurality of actions performed by a user; and
an intuition module configured for preventing said probabilistic learning module from substantially converging to a single processor action. - View Dependent Claims (395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406)
-
-
407-1. The method of claim 407, wherein said outcome can be represented by one of two possible values.
-
408-2. The method of claim 409, wherein said two possible values are the integers “
- zero” and
“
one.”
- zero” and
-
409-3. The method of claim 407, wherein said outcome can be represented by one of a range of continuous values.
-
429. A processing device having a function independent of determining an optimum action, comprising:
-
an action selection module configured for selecting one of a plurality of processor actions, said action selection being based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions, wherein said selected processor action affects said processing device function;
an outcome evaluation module configured for determining an outcome of one or both of said identified user action and said selected processor action; and
a probability update module configured for updating said action probability distribution based on said outcome. - View Dependent Claims (430, 431, 432, 433, 434, 435, 436, 437)
-
-
438. A method of providing learning capability to a processing device having one or more objectives, comprising:
-
identifying actions from a plurality of users;
selecting one or more of a plurality of processor actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions;
determining one or more outcomes of one or both of said identified plurality of user actions and said selected one or more processor actions;
updating said action probability distribution using one or more learning automatons based on said one or more outcomes; and
modifying one or more of said processor action selection, said outcome determination, and said action probability distribution update based on said one or more objectives. - View Dependent Claims (439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459, 460)
-
-
461. A method of providing learning capability to a processing device having one or more objectives, comprising:
-
identifying actions from users divided amongst a plurality of user sets;
for each of said user sets;
selecting one or more of a plurality of processor actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions;
determining one or more outcomes of one or more actions from said each user set and said selected one or more processor actions;
updating said action probability distribution using a learning automaton based on said one or more outcomes; and
modifying one or more of said processor action selection, said outcome determination, and said action probability distribution update based on said one or more objectives. - View Dependent Claims (462, 463, 464, 465, 466, 467, 468, 469, 470, 471, 472, 473, 474, 475, 476, 477, 478, 479, 480, 481, 482, 483)
-
-
484. A processing device having one or more objectives, comprising:
-
a probabilistic learning module having a learning automaton configured for learning a plurality of processor actions in response to a plurality of actions performed by a plurality of users; and
an intuition module configured for modifying a functionality of said probabilistic learning module based on said one or more objectives. - View Dependent Claims (485, 486, 487, 488, 489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 499, 500, 501, 502, 503, 504, 505, 506, 507, 508, 509, 510, 511, 512, 513, 514, 515, 516, 517, 518)
-
-
519. A method of providing learning capability to a processing device having one or more objectives, comprising:
-
identifying a plurality of user actions;
selecting one or more of a plurality of processor actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions;
weighting said identified plurality of user actions;
determining one or more outcomes of said plurality of weighted user actions; and
updating said action probability distribution based on said one or more outcomes. - View Dependent Claims (520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538)
-
-
539. A processing device having one or more objectives, comprising:
-
an action selection module configured for selecting one or more of a plurality of processor actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions;
an outcome evaluation module configured for weighting a plurality of identified user actions, and for determining one or more outcomes of said plurality of weighted user actions; and
a probability update module configured for updating said action probability distribution based on said outcome. - View Dependent Claims (540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554)
-
-
555. A method of providing learning capability to a processing device having one or more objectives, comprising:
-
identifying a plurality of user actions;
selecting one of a plurality of processor actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions;
determining a success ratio of said selected processor action relative to said identified plurality of user actions;
comparing said determined success ratio to a reference success ratio;
determining an outcome of said success ratio comparison; and
updating said action probability distribution based on said outcome. - View Dependent Claims (556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 568, 569, 570, 571, 572, 573)
-
-
574. A processing device having one or more objectives, comprising:
-
an action selection module configured for selecting one of a plurality of processor actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions;
an outcome evaluation module configured for determining a success ratio of said selected processor action relative to a plurality of user actions, for comparing said determined success ratio to a reference success ratio, and for determining an outcome of said success ratio comparison; and
a probability update module configured for updating said action probability distribution based on said outcome. - View Dependent Claims (575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590)
-
-
591. A method of providing learning capability to a processing device having one or more objectives, comprising:
-
identifying actions from a plurality of users;
selecting one of a plurality of processor actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions;
determining if said selected processor action has a relative success level for a majority of said plurality of users;
determining an outcome of said success determination; and
updating said action probability distribution based on said outcome. - View Dependent Claims (592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607)
-
-
608. A processing device having one or more objectives, comprising:
-
an action selection module configured for selecting one of a plurality of processor actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions;
an outcome evaluation module configured for determining if said selected processor action has a relative success level for a majority of a plurality of users, and for determining an outcome of said success determination; and
a probability update module configured for updating said action probability distribution based on said outcome. - View Dependent Claims (609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 620, 621, 622)
-
-
623. A method of providing learning capability to a processing device having one or more objectives, comprising:
-
selecting one or more of a plurality of processor actions that are respectively linked to a plurality of user parameters, said selection being based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of linked processor actions;
linking said one or more selected process actions with one or more of said plurality of user parameters;
determining one or more outcomes of said one or more linked processor actions; and
updating said action probability distribution based on said one or more outcomes. - View Dependent Claims (624, 625, 626, 627, 628, 629, 630, 631, 632, 633, 634, 635, 636, 637, 638, 639, 640, 641, 642, 643)
-
-
644. A processing device having one or more objectives, comprising:
-
an action selection module configured for selecting one or more of a plurality of processor actions that are respectively linked to a plurality of user parameters, said selection being based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of linked processor actions;
an outcome evaluation module configured for linking said one or more selected process actions with one or more of said plurality of user parameters, and for determining one or more outcomes of said one or more linked processor actions and one or more user actions; and
a probability update module configured for updating said action probability distribution based on said one or more outcomes. - View Dependent Claims (645, 646, 647, 648, 649, 650, 651, 652, 653, 654, 655, 656, 657, 658, 659, 660, 661, 662)
-
-
663. A method of providing learning capability to a processing device having an objective, comprising:
-
generating a list containing a plurality of listed items with an associated item probability distribution comprising a plurality of probability values corresponding to said plurality of listed items;
selecting one or more items from said plurality of listed items based on said item probability distribution;
determining a performance index indicative of a performance of said processing device relative to said objective; and
modifying said item probability distribution based on said performance index. - View Dependent Claims (664, 665, 666, 667, 668, 669, 670, 671, 672, 673, 674, 675, 676, 677, 678, 679, 680, 681, 682, 683, 684, 685, 686, 687, 688, 689, 690, 691, 692, 693, 694, 695, 696, 697, 698, 699)
-
-
700. A processing device having an objective, comprising:
-
a probabilistic learning module configured for learning a plurality of favorite items of a user in response to identified user items; and
an intuition module configured for modifying a functionality of said probabilistic learning module based on said objective. - View Dependent Claims (701, 702, 703, 704, 705, 706, 707, 708, 709, 710, 711, 712, 713, 714, 715, 716, 717, 718, 719, 720, 721, 722, 723, 724, 725, 726, 727)
-
-
728. A method of providing learning capability to a processing device having an objective, comprising:
-
generating a plurality of lists respectively corresponding to a plurality of item parameter values, each of said plurality of lists containing a plurality of listed items with an associated item probability distribution comprising a plurality of probability values corresponding to said plurality of listed items;
selecting a list corresponding to a parameter value exhibited by a currently identified action associated item; and
in said selected list, selecting one or more listed items from said plurality of listed items based on said item probability distribution;
determining a performance index indicative of a performance of said processing device relative to said objective; and
modifying said item probability distribution based on said performance index. - View Dependent Claims (729, 730, 731, 732, 733, 734, 735, 736, 737, 738, 739, 740, 741, 742, 743, 744, 745, 746, 747, 748, 749, 750, 751, 752, 753, 754, 755, 756)
-
-
757. A method of providing learning capability to a phone number calling system having an objective of anticipating called phone numbers, comprising:
-
generating a phone list containing at least a plurality of listed phone numbers and a phone number probability distribution comprising a plurality of probability values corresponding to said plurality of listed phone numbers;
selecting a set of phone numbers from said plurality of listed phone numbers based on said phone number probability distribution;
determining a performance index indicative of a performance of said phone number calling system relative to said objective; and
modifying said phone number probability distribution based on said performance index. - View Dependent Claims (758, 759, 760, 761, 762, 763, 764, 765, 766, 767, 768, 769, 770, 771, 772, 773, 774, 775, 776, 777, 778, 779, 780, 781, 782, 783, 784, 785, 786, 787, 788, 789, 790, 791, 792, 793, 794, 795, 796)
-
-
797. A phone number calling system having an objective of anticipating called phone numbers, comprising:
-
a probabilistic learning module configured for learning favorite phone numbers of a user in response to phone calls; and
an intuition module configured for modifying a functionality of said probabilistic learning module based on said objective. - View Dependent Claims (798, 799, 800, 801, 802, 803, 804, 805, 806, 807, 808, 809, 810, 811, 812, 813, 814, 815, 816, 817, 818, 819, 820, 821, 822, 823, 824, 825, 826, 827, 828, 829, 830)
-
-
831. A method of providing learning capability to a phone number calling system, comprising:
-
identifying a plurality of phone numbers associated with a plurality of phone calls;
maintaining a phone list containing said plurality of phone numbers and a plurality of priority values respectively associated with said plurality of phone numbers;
selecting a set of phone numbers from said plurality of listed phone numbers based on said plurality of priority values;
communicating said phone number set to a user. - View Dependent Claims (832, 833, 834, 835, 836, 837, 838, 839, 840, 841, 842, 843)
-
-
844. A method of providing learning capability to a television channel control system having an objective of anticipating watched television channels, comprising:
-
generating a list containing a plurality of listed television channels with an associated television channel probability distribution comprising a plurality of probability values corresponding to said plurality of listed television channels;
selecting one or more television channels from said plurality of listed television channels based on said television channel probability distribution;
determining a performance index indicative of a performance of said processing device relative to said objective; and
modifying said television channel probability distribution based on said performance index. - View Dependent Claims (845, 846, 847, 848, 849, 850, 851, 852, 853, 854, 855, 856, 857, 858, 859, 860, 861, 862, 863, 864, 865, 866, 867, 868, 869, 870, 871, 872, 873, 874, 875, 876, 877, 878, 879, 880)
-
-
881. A television channel control system having an objective of anticipating watched television channels, comprising:
-
a probabilistic learning module configured for learning favorite television channels of a user in response to watched television channels by said user; and
an intuition module configured for modifying a functionality of said probabilistic learning module based on said objective. - View Dependent Claims (882, 883, 884, 885, 886, 887, 888, 889, 890, 891, 892, 893, 894, 895, 896, 897, 898, 899, 900, 901, 902, 903, 904, 905, 906)
-
-
907. A method of providing learning capability to a television channel control system, comprising:
-
generating a plurality of lists respectively associated with a plurality of television channel parameter values, each of said plurality of lists containing a plurality of listed television channels with an associated television channel probability distribution comprising a plurality of probability values corresponding to said plurality of listed television channels;
selecting a list corresponding to a television channel parameter value exhibited by a currently watched television channel; and
in said selected list, selecting one or more listed actions from said plurality of listed actions based on said action probability distribution;
determining a performance index indicative of a performance of said processing device relative to said objective; and
modifying said action probability distribution based on said performance index. - View Dependent Claims (908, 909, 910, 911, 912, 913, 914, 915, 916, 917, 918, 919, 920, 921, 922, 923, 924, 925, 926, 927, 928, 929, 930, 931, 932, 933, 934, 935, 936, 937)
-
-
938. A method of providing learning capability to an educational toy, comprising:
-
selecting one of a plurality of toy actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of toy actions, said plurality of toy actions being associated with a plurality of different difficulty levels;
identifying an action performed by a user;
determining an outcome of said selected toy action relative to said identified user action; and
updating said action probability distribution based on said outcome and said difficulty level of said selected toy action. - View Dependent Claims (939, 940, 941, 942, 943, 944, 945, 946, 947, 948, 949, 950, 951, 952, 953, 954, 955, 956)
-
-
957. A method of providing learning capability to an educational toy having an objective of increasing an educational level of a user, comprising:
-
selecting one of a plurality of toy actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of toy actions, said plurality of toy actions being associated with a plurality of different difficulty levels;
identifying an action performed by said user;
determining an outcome of said selected toy action relative to said identified user action;
updating said action probability distribution based on said outcome; and
modifying one or more of said toy action selection, outcome determination, and action probability distribution update based on said objective. - View Dependent Claims (958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971)
-
-
972. An educational toy having an objective of increasing an educational level of a user, comprising:
-
a probabilistic learning module configured for learning a plurality of toy actions in response to a plurality of actions performed by a user; and
an intuition module configured for modifying a functionality of said probabilistic learning module based on said objective. - View Dependent Claims (973, 974, 975, 976, 977, 978, 979, 980, 981, 982, 983, 984)
-
-
985. A method of providing learning capability to a processing device, comprising:
-
selecting one of a plurality of processor actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions, said plurality of processor actions being associated with a plurality of different difficulty levels;
identifying an action performed by a user;
determining an outcome of said selected processor action relative to said identified user action; and
updating said action probability distribution based on said outcome and said difficulty level of said selected processor action. - View Dependent Claims (986, 987, 988, 989, 990, 991, 992, 993, 994, 995, 996, 997, 998, 999, 1000, 1001, 1002, 1003, 1004)
-
-
1005. A method of providing learning capability to a processing device having one or more objectives, comprising:
-
selecting one of a plurality of processor actions based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions, said plurality of processor actions being associated with a plurality of different difficulty levels;
identifying an action performed by said user;
determining an outcome of said selected processor action relative to said identified user action;
updating said action probability distribution based on said outcome; and
modifying one or more of said action selection, outcome determination, and action probability distribution update based on said objective. - View Dependent Claims (1006, 1007, 1008, 1009, 1010, 1011, 1012, 1013, 1014, 1015, 1016, 1017, 1018, 1019, 1020)
-
-
1021. A processing device having one or more objectives, comprising:
-
an action selection module configured for selecting one of a plurality of processor actions, said action selection being based on an action probability distribution comprising a plurality of probability values corresponding to said plurality of processor actions, said plurality of processor actions being associated with a plurality of different difficulty levels;
an outcome evaluation module configured for determining an outcome of said selected processor action relative to said user action; and
a probability update module configured for updating said action probability distribution based on said outcome and said difficulty level of said selected processor action. - View Dependent Claims (1022, 1023, 1024, 1025, 1026, 1027, 1028, 1029, 1030, 1031, 1032, 1033, 1034, 1035, 1036, 1037)
-
Specification