Skip to content

Issues: robertknight/rten

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

8-bit quantization MVP
#347 opened Sep 6, 2024 by robertknight
6 of 10 tasks
Adjust default thread count on Apple Silicon systems performance Issues that affect model inference or loading performance
#342 opened Sep 2, 2024 by robertknight
Align ReduceMin / ReduceMax etc. handling of empty tensors with spec Spec compliance Issues with RTen behavior not matching the ONNX specifications
#341 opened Sep 1, 2024 by robertknight
Prepack weights when model is loaded performance Issues that affect model inference or loading performance
#214 opened May 27, 2024 by robertknight
Make unary ops more efficient with non-contiguous inputs performance Issues that affect model inference or loading performance
#192 opened May 20, 2024 by robertknight
1 of 2 tasks
Run tests under AddressSanitizer (and possibly other sanitizers) qa Quality / correctness checks
#151 opened May 5, 2024 by robertknight
Validate operator input counts tooling Tools for debugging / profiling etc.
#133 opened Apr 29, 2024 by robertknight
Enable re-using pool across graph executions performance Issues that affect model inference or loading performance
#122 opened Apr 26, 2024 by robertknight
Make execution planner smarter to enable running more operators in-place performance Issues that affect model inference or loading performance
#98 opened Apr 16, 2024 by robertknight
Run tests under WebAssembly in CI qa Quality / correctness checks WebAssembly
#93 opened Apr 14, 2024 by robertknight
Document rten CLI tool documentation Improvements or additions to documentation
#52 opened Feb 8, 2024 by robertknight
Convert quantized models
#42 opened Jan 20, 2024 by igor-yusupov
Apple AMX support performance Issues that affect model inference or loading performance
#18 opened Dec 31, 2023 by robertknight
Include a larger AVX512 GEMM kernel for (server) CPUs with 2 FMA units performance Issues that affect model inference or loading performance
#17 opened Dec 30, 2023 by robertknight
Support for missing ONNX operators
#14 opened Sep 17, 2023 by robertknight 100+
Consider coalescing dimensions in indexed iteration performance Issues that affect model inference or loading performance
#11 opened Feb 10, 2023 by robertknight
ProTip! Updated in the last three days: updated:>2024-09-17.