Term: Spring 2017
-
Team #4
-
Team members
- team member 1 (
Qingyuan Zhang
) - team member 2 (
Zhilin Fan
) - team member 3 (
Yuxin Zhu
) - team member 4 (
Vanessa Huang
) - team member 5 (
Zishuo Li
)
- team member 1 (
-
Project summary: In this project, we utilized deep learning packages in matlab to do feature extraction from given 2000 gray scale images of poodles and fried chicken. Then we applied machine learning tools to help build good classifier and make prediction on new images. We used the given 5000-dimensional SIFT features at our first attempt, but we only got 80% accuracy rate and the execution time was not satisfying. After some attempts, we used LBP(Local Binary Patterns) to extract features. The processing time of 2000 images is 210s and the dimension of feature matrix is only 2000*59. We used GBM, BP Neural Networks, Random Forest, SVM and Logistic Regression for advanced models.
Here is a summary of our models.
We chose Majority Vote(BP Netural Networks, SVM, Logistic Regression) as our final model. Since training time of each model is very short, time won't be a problem for majority vote. Although we sacrifice little accuracy, We can get a more robust model.
In general, the error rate of our Baseline model is 27%, and that of our final model is 13%. And thus, we have successfully improved our model without scarificing computational efficiency.
Reproducibility: To reproduce our work, set your working directory onto doc
folder, then run the main.rmd
file.
Contribution statement: (default) All team members contributed equally in all stages of this project. All team members approve our work presented in this GitHub repository including this contributions statement.
Following suggestions by RICH FITZJOHN (@richfitz). This folder is orgarnized as follows.
proj/
├── lib/
├── data/
├── doc/
├── figs/
└── output/
Please see each subfolder for a README file.