Open Source Software Used by Compter Vision

1 Computer Vision
2 Features
3 Machine Learning
4 Data Mining
- 4.1 Frequent Pattern Mining

1 Computer Vision

OpenCV is the most widely used open source software. It implemented nearly all the classic algorithms in computer vision. However, the implemention in OpenCV is highly optimized and as a result fairly complex. It is not suggested to revise the OpenCV code for research purpose.

1.1 Object Detection

deformable part model: state-of-the-art general purpose detector [http://people.cs.uchicago.edu/~rbg/latent/]
real time object instance detection based on gradient response map. [http://campar.in.tum.de/personal/hinterst/index/index.html]

1.1.1 Face Detection

OpenCV has a implementation based on Boosted Haar features

1.1.2 Pedestrain Detection

1.2 Articulated Human Part Detection

a variant of deformable part model by Yi Yang in UCI [http://www.ics.uci.edu/~yyang8/research/pose/]

1.3 Image Recognition

1.3.1 Scene Recognition

ObjectBank [http://vision.stanford.edu/projects/objectbank/]

1.3.2 General Image Recognition

LLC implemented spatial pyramid bag-of-words approach for image recognition. [http://www.ifp.illinois.edu/~jyang29/LLC.htm]

1.4 Optical Flow

2 Features

2.1 Interest Point Detetor

STIP: an interest point detetor for videos. This software only provides binary descriptors. It produce fairly sparse interest point, thus it is suitable for recognize actions with large motion. The software also implemented HOG and HOF descriptors. [http://www.di.ens.fr/~laptev/download.html]

2.2 Feature Coding

2.2.1 Sparse Coding

SPArse Modeling Software is extremely fast [http://spams-devel.gforge.inria.fr/]

2.2.2 LLC coding

LLC coding use nearest neighbor heuristics to achieve faster speed than sparse coding

LLC toolbox: [http://www.ifp.illinois.edu/~jyang29/LLC.htm]

2.2.3 Binary coding

BRIEF, D-BRIEF: binary descriptors [http://cvlab.epfl.ch/research/detect/dbrief/] [http://cvlab.epfl.ch/software/brief/]

2.3 Local Features

2.3.1 SIFT

2.3.2 HOG

2.3.3 HOF

2.3.4 HOG-3D

The gradient of HOG-3D is an average gradient over a cube in a video.

[http://lear.inrialpes.fr/people/klaeser/research_hog3d]

2.3.5 Spatio-temporal Orientation

Spatio-temporal orientation has been shown to achieve excelent performance

it is implemented in AcionBank software.

3 Machine Learning

Shogun is a large machine learning toolbox that implemented a lot of machine learning algorithms.

3.1 Clusteirng

3.1.1 K-Means

3.2 CLassification

3.2.1 Boosting

3.2.2 Multiple Kernel Learning

multiple learning is a feature weighting/selecing method

shogun has a good implementation.

3.3 Deep Learning

4 Data Mining

4.1 Frequent Pattern Mining

Mafia: Maximal Frequent Pattern Mining [http://himalaya-tools.sourceforge.net/Mafia/]

Author: Jiang Wang, Ph.D. Candidate, Northwestern Univeristy

Date: 2013-02-06 13:26:13 CST

HTML generated by org-mode TAG=7.01g in emacs 24