You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
refactor: removing the strict_types and max_batch_size apis
BREAKING CHANGE: This commit removes the strict types and max_batch_size apis. We are doing this because the functionality of these APIs in TRT is convoluted and likely to be ignored during building. A replacement for strict types with actual guarantees will be added at a later date.
Signed-off-by: Dheeraj Peri <[email protected]>
enabled_precision (Set(Union(torch.dtype, torch_tensorrt.dtype))): The set of datatypes that TensorRT can use when selecting kernels
277
267
refit (bool): Enable refitting
278
268
debug (bool): Enable debuggable engine
279
-
strict_types (bool): Kernels should strictly run in a particular operating precision. Enabled precision should only have one type in the set
280
269
capability (torch_tensorrt.EngineCapability): Restrict kernel selection to safe gpu kernels or safe dla kernels
281
270
num_min_timing_iters (int): Number of minimization timing iterations used to select kernels
282
271
num_avg_timing_iters (int): Number of averaging timing iterations used to select kernels
283
272
workspace_size (int): Maximum size of workspace given to TensorRT
284
-
max_batch_size (int): Maximum batch size (must be >= 1 to be set, 0 means not set)
285
273
truncate_long_and_double (bool): Truncate weights provided in int64 or double (float64) to int32 and float32
286
274
calibrator (Union(torch_tensorrt._C.IInt8Calibrator, tensorrt.IInt8Calibrator)): Calibrator object which will provide data to the PTQ system for INT8 Calibration
0 commit comments