CPC G06F 9/3877 (2013.01) [G06F 9/3836 (2013.01); G06F 9/3851 (2013.01); G06F 15/7807 (2013.01); G06F 17/16 (2013.01); G06N 20/00 (2019.01); G06N 20/10 (2019.01); G06F 9/3001 (2013.01); G06F 15/7864 (2013.01); G06F 15/8023 (2013.01); G06F 2212/602 (2013.01); G06N 5/04 (2013.01); G06N 20/20 (2019.01)] | 42 Claims |
19. A method comprising:
receiving a set of commands for performance-critical operations in a first instruction set architecture (ISA) format from a core;
generating a second ISA format from the first ISA format;
streaming the set of commands in the second ISA format to an inference engine, wherein the set of commands in the first ISA format is an asynchronous instruction set; and
executing by the inference engine the stream of the set of commands in the second ISA format to program a plurality of components within the inference engine to perform an ML operation.
|