90cf3dd37c
- Add IPreparedModel::dispatchRequest to NNAPI 1.3 HAL - Add IDispatchExecutionCallback to allow clients query information related to the actual evaluation. Bug: 142778241 Test: mm Change-Id: I87cbb7f2aee87342b0418fce04eb4050e2bc1920
245 lines
14 KiB
Text
245 lines
14 KiB
Text
/*
|
|
* Copyright (C) 2019 The Android Open Source Project
|
|
*
|
|
* Licensed under the Apache License, Version 2.0 (the "License");
|
|
* you may not use this file except in compliance with the License.
|
|
* You may obtain a copy of the License at
|
|
*
|
|
* http://www.apache.org/licenses/LICENSE-2.0
|
|
*
|
|
* Unless required by applicable law or agreed to in writing, software
|
|
* distributed under the License is distributed on an "AS IS" BASIS,
|
|
* WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
|
|
* See the License for the specific language governing permissions and
|
|
* limitations under the License.
|
|
*/
|
|
|
|
package android.hardware.neuralnetworks@1.3;
|
|
|
|
import @1.2::IPreparedModel;
|
|
import @1.2::MeasureTiming;
|
|
import @1.2::OutputShape;
|
|
import @1.2::Timing;
|
|
import ErrorStatus;
|
|
import OptionalTimePoint;
|
|
import Request;
|
|
import IExecutionCallback;
|
|
import IFencedExecutionCallback;
|
|
|
|
/**
|
|
* IPreparedModel describes a model that has been prepared for execution and
|
|
* is used to launch executions.
|
|
*/
|
|
interface IPreparedModel extends @1.2::IPreparedModel {
|
|
/**
|
|
* Launches an asynchronous execution on a prepared model.
|
|
*
|
|
* The execution is performed asynchronously with respect to the caller.
|
|
* execute_1_3 must verify the inputs to the function are correct, and the usages
|
|
* of memory pools allocated by IDevice::allocate are valid. If there is
|
|
* an error, execute_1_3 must immediately invoke the callback with the
|
|
* appropriate ErrorStatus value, then return with the same ErrorStatus. If
|
|
* the inputs to the function are valid and there is no error, execute_1_3 must
|
|
* launch an asynchronous task to perform the execution in the background,
|
|
* and immediately return with ErrorStatus::NONE. If the asynchronous task
|
|
* fails to launch, execute_1_3 must immediately invoke the callback with
|
|
* ErrorStatus::GENERAL_FAILURE, then return with
|
|
* ErrorStatus::GENERAL_FAILURE.
|
|
*
|
|
* When the asynchronous task has finished its execution, it must
|
|
* immediately invoke the callback object provided as an input to the
|
|
* execute_1_3 function. This callback must be provided with the ErrorStatus of
|
|
* the execution.
|
|
*
|
|
* If the launch is successful, the caller must not change the content of
|
|
* any data object referenced by 'request' (described by the
|
|
* {@link @1.0::DataLocation} of a {@link @1.0::RequestArgument}) until the
|
|
* asynchronous task has invoked the callback object. The asynchronous task
|
|
* must not change the content of any of the data objects corresponding to
|
|
* 'request' inputs.
|
|
*
|
|
* If the prepared model was prepared from a model wherein all tensor
|
|
* operands have fully specified dimensions, and the inputs to the function
|
|
* are valid, then:
|
|
* - the execution should launch successfully (ErrorStatus::NONE): There
|
|
* must be no failure unless the device itself is in a bad state.
|
|
* - if at execution time every operation's input operands have legal
|
|
* values, the execution should complete successfully (ErrorStatus::NONE):
|
|
* There must be no failure unless the device itself is in a bad state.
|
|
*
|
|
* execute_1_3 can be called with an optional deadline. If the execution
|
|
* is not able to completed before the provided deadline, the execution
|
|
* must be aborted, and either {@link
|
|
* ErrorStatus::MISSED_DEADLINE_TRANSIENT} or {@link
|
|
* ErrorStatus::MISSED_DEADLINE_PERSISTENT} must be returned. The error due
|
|
* to an abort must be sent the same way as other errors, described above.
|
|
* If the service reports that it does not support execution deadlines via
|
|
* IDevice::supportsDeadlines, and execute_1_3 is called with a deadline,
|
|
* then the argument is invalid, and {@link ErrorStatus::INVALID_ARGUMENT}
|
|
* must be returned.
|
|
*
|
|
* Any number of calls to the execute* and executeSynchronously* functions,
|
|
* in any combination, may be made concurrently, even on the same
|
|
* IPreparedModel object.
|
|
*
|
|
* @param request The input and output information on which the prepared
|
|
* model is to be executed.
|
|
* @param measure Specifies whether or not to measure duration of the execution.
|
|
* The duration runs from the time the driver sees the call
|
|
* to the execute_1_3 function to the time the driver invokes
|
|
* the callback.
|
|
* @param deadline The time by which execution must complete. If the
|
|
* execution cannot be finished by the deadline, the
|
|
* execution must be aborted.
|
|
* @param callback A callback object used to return the error status of
|
|
* the execution, shape information of model output operands, and
|
|
* duration of execution. The callback object's notify function must
|
|
* be called exactly once, even if the execution was
|
|
* unsuccessful.
|
|
* @return status Error status of the call, must be:
|
|
* - NONE if task is successfully launched
|
|
* - DEVICE_UNAVAILABLE if driver is offline or busy
|
|
* - GENERAL_FAILURE if there is an unspecified error
|
|
* - OUTPUT_INSUFFICIENT_SIZE if provided output buffer is
|
|
* not large enough to store the resultant values
|
|
* - INVALID_ARGUMENT if one of the input arguments is
|
|
* invalid
|
|
* - MISSED_DEADLINE_* if the deadline for executing a model
|
|
* cannot be met
|
|
* - RESOURCE_EXHAUSTED_* if the task was aborted by the
|
|
* driver
|
|
*/
|
|
execute_1_3(Request request, MeasureTiming measure, OptionalTimePoint deadline,
|
|
IExecutionCallback callback)
|
|
generates (ErrorStatus status);
|
|
|
|
/**
|
|
* Performs a synchronous execution on a prepared model.
|
|
*
|
|
* The execution is performed synchronously with respect to the caller.
|
|
* executeSynchronously_1_3 must verify the inputs to the function are
|
|
* correct, and the usages of memory pools allocated by IDevice::allocate
|
|
* are valid. If there is an error, executeSynchronously_1_3 must immediately
|
|
* return with the appropriate ErrorStatus value. If the inputs to the
|
|
* function are valid and there is no error, executeSynchronously_1_3 must
|
|
* perform the execution, and must not return until the execution is
|
|
* complete.
|
|
*
|
|
* The caller must not change the content of any data object referenced by
|
|
* 'request' (described by the {@link @1.0::DataLocation} of a
|
|
* {@link @1.0::RequestArgument}) until executeSynchronously_1_3
|
|
* returns. executeSynchronously_1_3 must not change the content of any of the
|
|
* data objects corresponding to 'request' inputs.
|
|
*
|
|
* If the prepared model was prepared from a model wherein all tensor
|
|
* operands have fully specified dimensions, and the inputs to the function
|
|
* are valid, and at execution time every operation's input operands have
|
|
* legal values, then the execution should complete successfully
|
|
* (ErrorStatus::NONE): There must be no failure unless the device itself is
|
|
* in a bad state.
|
|
*
|
|
* executeSynchronously_1_3 can be called with an optional deadline. If the
|
|
* execution is not able to completed before the provided deadline, the
|
|
* execution must be aborted, and either {@link
|
|
* ErrorStatus::MISSED_DEADLINE_TRANSIENT} or {@link
|
|
* ErrorStatus::MISSED_DEADLINE_PERSISTENT} must be returned. The error due
|
|
* to an abort must be sent the same way as other errors, described above.
|
|
* If the service reports that it does not support execution deadlines via
|
|
* IDevice::supportsDeadlines, and executeSynchronously_1_3 is called with a
|
|
* deadline, then the argument is invalid, and
|
|
* {@link ErrorStatus::INVALID_ARGUMENT} must be returned.
|
|
*
|
|
* Any number of calls to the execute* and executeSynchronously* functions,
|
|
* in any combination, may be made concurrently, even on the same
|
|
* IPreparedModel object.
|
|
*
|
|
* @param request The input and output information on which the prepared
|
|
* model is to be executed.
|
|
* @param measure Specifies whether or not to measure duration of the execution.
|
|
* The duration runs from the time the driver sees the call
|
|
* to the executeSynchronously_1_3 function to the time the driver
|
|
* returns from the function.
|
|
* @param deadline The time by which execution must complete. If the
|
|
* execution cannot be finished by the deadline, the
|
|
* execution must be aborted.
|
|
* @return status Error status of the execution, must be:
|
|
* - NONE if execution is performed successfully
|
|
* - DEVICE_UNAVAILABLE if driver is offline or busy
|
|
* - GENERAL_FAILURE if there is an unspecified error
|
|
* - OUTPUT_INSUFFICIENT_SIZE if at least one output
|
|
* operand buffer is not large enough to store the
|
|
* corresponding output
|
|
* - INVALID_ARGUMENT if one of the input arguments is
|
|
* invalid
|
|
* - MISSED_DEADLINE_* if the deadline for executing a model
|
|
* cannot be met
|
|
* - RESOURCE_EXHAUSTED_* if the task was aborted by the
|
|
* driver
|
|
* @return outputShapes A list of shape information of model output operands.
|
|
* The index into "outputShapes" corresponds to the index
|
|
* of the output operand in the Request outputs vector.
|
|
* outputShapes must be empty unless the status is either
|
|
* NONE or OUTPUT_INSUFFICIENT_SIZE.
|
|
* @return timing Duration of execution. Unless measure is YES and status is
|
|
* NONE, all times must be reported as UINT64_MAX. A driver may
|
|
* choose to report any time as UINT64_MAX, indicating that
|
|
* measurement is not available.
|
|
*/
|
|
executeSynchronously_1_3(Request request, MeasureTiming measure,
|
|
OptionalTimePoint deadline)
|
|
generates (ErrorStatus status, vec<OutputShape> outputShapes,
|
|
Timing timing);
|
|
|
|
/**
|
|
* Launch a fenced asynchronous execution on a prepared model.
|
|
*
|
|
* The execution is performed asynchronously with respect to the caller.
|
|
* executeFenced must fully validate the request, and only accept one that is
|
|
* guaranteed to be completed, unless a hardware failure or kernel panic happens on the device.
|
|
* If there is an error during validation, executeFenced must immediately return with
|
|
* the corresponding ErrorStatus. If the request is valid and there is no error launching,
|
|
* executeFenced must dispatch an asynchronous task to perform the execution in the
|
|
* background, and immediately return with ErrorStatus::NONE, a sync_fence that will be
|
|
* signaled once the execution is completed, and a callback that can be used by the client
|
|
* to query the duration and runtime error status. If the task has finished
|
|
* before the call returns, empty handle may be returned for the sync fence. If the
|
|
* asynchronous task fails to launch, executeFenced must immediately return with
|
|
* ErrorStatus::GENERAL_FAILURE, and empty handle for the sync fence and nullptr
|
|
* for callback. The execution must wait for all the sync fences (if any) in wait_for to be
|
|
* signaled before starting the actual execution.
|
|
*
|
|
* If any of sync fences in wait_for changes to error status after the executeFenced
|
|
* call succeeds, the driver must immediately set the returned sync fence to error status.
|
|
*
|
|
* When the asynchronous task has finished its execution, it must
|
|
* immediately signal the sync_fence created when dispatching. After
|
|
* the sync_fence is signaled, the task must not modify the content of
|
|
* any data object referenced by 'request' (described by the
|
|
* {@link @1.0::DataLocation} of a {@link @1.0::RequestArgument}).
|
|
*
|
|
* Any number of calls to the executeFenced, execute* and executeSynchronously*
|
|
* functions, in any combination, may be made concurrently, even on the same
|
|
* IPreparedModel object.
|
|
*
|
|
* @param request The input and output information on which the prepared
|
|
* model is to be executed.
|
|
* @param waitFor A vector of sync fence file descriptors.
|
|
* Execution must not start until all sync fences have been signaled.
|
|
* @param measure Specifies whether or not to measure duration of the execution.
|
|
* The duration runs from the time the driver sees the call
|
|
* to the executeFenced function to the time sync_fence is triggered.
|
|
* @return status Error status of the call, must be:
|
|
* - NONE if task is successfully launched
|
|
* - DEVICE_UNAVAILABLE if driver is offline or busy
|
|
* - GENERAL_FAILURE if there is an unspecified error
|
|
* - INVALID_ARGUMENT if one of the input arguments is invalid, including
|
|
* fences in error states.
|
|
* @return syncFence The sync fence that will be triggered when the task is completed.
|
|
* The sync fence will be set to error if a critical error,
|
|
* e.g. hardware failure or kernel panic, occurs when doing execution.
|
|
* @return callback The IFencedExecutionCallback can be used to query information like duration
|
|
* and error status when the execution is completed.
|
|
*/
|
|
executeFenced(Request request, vec<handle> waitFor, MeasureTiming measure)
|
|
generates (ErrorStatus status, handle syncFence, IFencedExecutionCallback callback);
|
|
};
|