WIP: Sparse coder #456

vene · Dec 6, 2011

This quick and dirty pull request aims to add an estimator (transformer) object that implements sparse coding against a fixed dictionary, in the form of the SparseCoder object.

At the same time this will address the small inconsistencies and missing info in docs that came up.

ogrisel · Dec 6, 2011

sklearn/decomposition/dict_learning.py

@@ -708,6 +708,77 @@ def transform(self, X, y=None):
        return code


+class SparseCoder(BaseDictionaryLearning):
+    """ Sparse coding


Please:

"""Sparse coding

PEP257 :)

GaelVaroquaux · Dec 7, 2011

With regards to precomputing wavelets dictionnaries, I don't not think that this is a good idea, because a Wavelet transform can be implemented much faster than a dot product. In addition, it is a orthogonal basis, thus sparse coding should preferably be done using soft thresholding. Finally, this would be fairly image or signal specific, and I don't like the idea of such application-specific code creeping in a general-purpose object.

GaelVaroquaux · Dec 7, 2011

sklearn/decomposition/dict_learning.py

@@ -708,8 +708,79 @@ def transform(self, X, y=None):
        return code


+class SparseCoder(BaseDictionaryLearning):


This object should be imported in the init of decomposition.

It should also be added in docs/modules/classes.rst

GaelVaroquaux · Dec 7, 2011

My biggest comment is that it is missing a narrative documentation. In
addition an example would be useful. You could do an example with
wavelet. Maybe to make it easier on computation power simply using a 1D
signal.

Gael

amueller · Dec 16, 2011

sklearn/decomposition/dict_learning.py

@@ -129,7 +129,8 @@ def sparse_encode(X, Y, gram=None, cov=None, algorithm='lasso_lars',
                    max_iter=1000)
        for k in xrange(n_features):
            # A huge amount of time is spent in this loop. It needs to be
-            # tight.
+            # tight
+


pep8 whitespace ;)

amueller · Dec 16, 2011

I think it would be good if the SparseCoder would be referenced in the DictionaryLearning "see also" section.
BTW I think the "see also" section of both, SparseCoder and DictionaryLearning should be cleaned up.
They contain a description that get's screwed up when generating the html docs.

vene · Dec 20, 2011

sklearn/decomposition/dict_learning.py


    if algorithm == 'lasso_lars':
        if alpha is None:
            alpha = 1.
+        alpha /= n_features  # account for scaling


Note that before, dict_learning and dict_learning_online would perform very different on exactly the same data and for the same alpha. This was indeed caused by sometimes forgetting to divide by n_features and this fixes it.

…into sparse-coder Conflicts: sklearn/linear_model/least_angle.py

Sparse coder

fabianp · Dec 20, 2011

merged, thanks

# More detailed explanatory text, if necessary. Wrap it to about 72 # characters or so. In some contexts, the first line is treated as the # subject of the commit and the rest of the text as the body. The # blank line separating the summary from the body is critical (unless # you omit the body entirely); various tools like `log`, `shortlog` # and `rebase` can get confused if you run the two together. # Explain the problem that this commit is solving. Focus on why you # are making this change as opposed to how (the code explains that). # Are there side effects or other unintuitive consequences of this # change? Here's the place to explain them. # Further paragraphs come after blank lines. # - Bullet points are okay, too # - Typically a hyphen or asterisk is used for the bullet, preceded # by a single space, with blank lines in between, but conventions # vary here # If you use an issue tracker, put references to them at the bottom, # like this: # Resolves: scikit-learn#123 # See also: scikit-learn#456, scikit-learn#789

vene added 4 commits December 6, 2011 11:56

Added SparseCoder estimator

5916017

Basic testing

52bbc80

DOC: add missing split_sign in docstrings

2e3316a

FIX: 10% of features should be at least 1

9656faf

ogrisel reviewed Dec 6, 2011
View reviewed changes

PEP257 :)

a7764a9

GaelVaroquaux reviewed Dec 7, 2011
View reviewed changes

vene added 9 commits December 7, 2011 10:31

restore typo

c1beb20

Added SparseCoder to init and class index

6b018a6

initial work on docs

fced530

implement noop fit in SparseCoder

20ba4fa

clean up test

826df8d

Fixed doc links

7fba5f5

Fixed lena in example

e9a6567

Merge branch 'master' into sparse-coder

b4fe032

cleaned up imports in test

3fac035

amueller reviewed Dec 16, 2011
View reviewed changes

vene added 9 commits December 19, 2011 10:34

Merge branch 'master' into sparse-coder

e22429f

FIX: objective functions in Lasso linear model docs

bb3e069

DOC: correct ordering of returns in dict_learning_online

325d5ae

DOC: clarified dimensions in _update_dict

05b4025

Fix the API and the scaling inside dict_learning

13a0507

DOC: specify scaling in linear_model.rst

ab7b857

work on failing tests

63f06a1

Merge branch 'master' into sparse-coder

5ef4a4d

skip tests that were wrongly passing before

5d06bee

vene added 6 commits December 19, 2011 19:03

Test for almost equal instead of equal in sparse_encode_error

697653e

FIX: slices generation

66796ca

Hide sparse_encode -- redundant

384f4b7

DOC: add optimization objective to lasso and enet docstrings

3aa5b12

DOC: make docstrings as good as I could

e9f432a

Warnings and deprecation

3a68704

vene reviewed Dec 20, 2011
View reviewed changes

vene added 7 commits December 20, 2011 11:06

DOC: better cross refs and docstrings

51f74dd

Adapted examples for alpha scaling

bb03e8a

Merge branch 'master' into sparse-coder

0572c56

PEP8

213e886

added sparse coding example

7fdd8df

s/threhold/threshold

6e7fd21

Merge branch 'master' of https://github.com/scikit-learn/scikit-learn …

a646e9a

…into sparse-coder Conflicts: sklearn/linear_model/least_angle.py

fabianp pushed a commit that referenced this pull request Dec 20, 2011

Merge pull request #456 from vene/sparse-coder

9a43b03

Sparse coder

fabianp merged commit 9a43b03 into scikit-learn:master Dec 20, 2011

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

WIP: Sparse coder #456

WIP: Sparse coder #456

Uh oh!

vene commented Dec 6, 2011

Uh oh!

ogrisel Dec 6, 2011

Uh oh!

GaelVaroquaux commented Dec 7, 2011

Uh oh!

GaelVaroquaux Dec 7, 2011

Uh oh!

GaelVaroquaux commented Dec 7, 2011

Uh oh!

amueller Dec 16, 2011

Uh oh!

amueller commented Dec 16, 2011

Uh oh!

vene Dec 20, 2011

Uh oh!

fabianp commented Dec 20, 2011

Uh oh!

Uh oh!

Search code, repositories, users, issues, pull requests...

Uh oh!

WIP: Sparse coder #456

WIP: Sparse coder #456

Uh oh!

Conversation

vene commented Dec 6, 2011

Uh oh!

ogrisel Dec 6, 2011

Choose a reason for hiding this comment

Uh oh!

GaelVaroquaux commented Dec 7, 2011

Uh oh!

GaelVaroquaux Dec 7, 2011

Choose a reason for hiding this comment

Uh oh!

GaelVaroquaux commented Dec 7, 2011

Uh oh!

amueller Dec 16, 2011

Choose a reason for hiding this comment

Uh oh!

amueller commented Dec 16, 2011

Uh oh!

vene Dec 20, 2011

Choose a reason for hiding this comment

Uh oh!

fabianp commented Dec 20, 2011

Uh oh!

Uh oh!