Skip to content

A toolkit for optimizing machine learning models for practical applications

License

Notifications You must be signed in to change notification settings

cisco-open/modelsmith

modelsmith logo

ModelSmith serves as a toolkit for compressing machine learning models, designed to enhance their speed, reduce their size, and improve energy efficiency. This toolbox addresses the demands for smaller model sizes, minimal memory usage, quicker inference times, energy-saving operations, and scenarios conforming to bandwidth limitations.

ModelSmith makes it ideal for deploying on a diverse array of devices and platforms, ensuring satisfactory performance. Its main features encompass post-training quantization, machine unlearning and a range of pruning algorithms, essential for optimizing models.

ModelSmith is developed by researchers and engineers working at Cisco Research.

Table of Contents

Getting Started

To get started with ModelSmith, follow these steps:

  1. Clone the repository to your local machine:

    git clone https://github.com/cisco-open/modelsmith.git

FAQ

1. How do I configure ModelSmith to run locally? Read here.

2. How do I configure ModelSmith to run on VM? Read here.

Features

  • Frontend Server: ModelSmith's frontend is built using Angular and is deployed as a production build to the 'frontend/server/modelsmith-build' folder. The frontend server, powered by Express.js, serves the web application in to users in optimized mode on port 4200. It provides a user interface for interacting with the application.

  • Backend Server: ModelSmith includes a backend server built using Node.js, which serves as the core of the application. The backend server handles various tasks such as model optimization, deployment, and management. It is accessible on port 3000. The backend offers APIs and services to support the frontend's functionality and communicates with a VM for running the python scripts.

  • ModelSmith Project - Python Files for Model Training: This feature includes a comprehensive collection of Python scripts designed for the training of machine learning models. These scripts are fine-tuned for high efficiency and can be executed on local machines with GPU support or on virtual machines (VMs), offering flexibility for users to train models using their own hardware or cloud resources.

Supported technologies

Contributions

ModelSmith strives to maintain a more up-to-date fork and encourages contributions from all individuals.

Maintainers and contributors

Yuguang Yao

Ioan Pop (John)

Yuzhang Shang

Yihua Zhang

Gaowen Liu

Contributing & Future Directions

We have several TODO's spread out throughout our files! We will continue working in these directions to improve ModelSmith, adjusting as needed based on our user's feedback

We intend to add functionalities to the log analysis for unsupervised log analysis.

If you wish to contribute or suggest any additional funtionalities, please check out Contributing Guidelines

License

Apache License 2.0.

About

A toolkit for optimizing machine learning models for practical applications

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •