Implement MAML meta-opt #23

wj-Mcat · 2022-04-14T08:48:12Z

Description

Try to refactor MAML meta learning algriothm to make it more reusable in paddle-based applications.

Consideration

In MAML module, there are three things which is wired against the normal model application code：

clone module which retrain the computation graph
accumulate the gradient on cloned model
backward the gradient based on query set data

Design

class BaseLearner(ABC):
    """Abstract Base Learner Class"""
    def __init__(self, module: Layer, optimizer: Optimizer) -> None:
        """The constructor of BaseLearner

        Args:
            module (Layer): the model to be trained
        """
        super().__init__()
        self._source_module = module
        self.cloned_module = None
        self.optimizer = optimizer

    def new_cloned_model(self,) -> Layer:
        """get the cloned model and keep the computation gragh

        Returns:
            Layer: the cloned model
        """
        self.cloned_module = clone_model(self._source_module)
        return self.cloned_module

    @abstractmethod
    def adapt(self, train_loss: Tensor) -> None:
        """Adapt the model to the current training loss

        Args:
            train_loss (Tensor): the current training loss
        """
        raise NotImplementedError


    @abstractmethod
    def step(self) -> None:
        """Perform a step of training

        Args:
            loss (float): _description_

        Raises:
            NotImplementedError: _description_
        """
        raise NotImplementedError

    def clear_grad(self):
        """clear the gradient in the computation graph
        """
        self.optimizer.clear_grad()

adapt: accumulate the gradient on cloned model
new_cloned_model: clone and save the model
step: run step on optimizer in parameters of source model
clear_grad: clear the gradient

wj-Mcat · 2022-04-14T09:19:13Z

Try to accomplish the task : #17

wj-Mcat · 2022-04-19T03:48:30Z

Changes in Learner Structure

class BaseLearner(Layer):
    """Abstract Base Learner Class"""
    def __init__(self, module: Layer) -> None:
        """The constructor of BaseLearner

        Args:
            module (Layer): the model to be trained
        """
        super().__init__()
        self.module = module

    @abstractmethod
    def adapt(self, loss: Tensor) -> None:
        """Adapt the model to the current training loss

        Args:
            loss (Tensor): the current training loss
        """
        raise NotImplementedError

-    def new_cloned_model(self,) -> Layer:
+   def clone(self: Type[Learner]) -> Learner:
        """create cloned module and keep the computation gragh

        Args:
            self (Type[Learner]): the sub-learner

        Returns:
            Learner: the cloned model
        """
        raise NotImplementedError

    def forward(self, *args, **kwargs):
        return self.module(*args, **kwargs)

-   @abstractmethod
-   def step(self) -> None:
-        """Perform a step of training
-
-        Args: 
-           loss (float): _description_
-
-        Raises:
-            NotImplementedError: _description_
-        """
-       raise NotImplementedError
-
-   def clear_grad(self):
-        """clear the gradient in the computation graph
-        """
-        self.optimizer.clear_grad()

As the above code shown, there are mainly two changes:

change new_clone_model to clone which can reuse the meta algorithm.
remove optimzier from learner which handle the outer loop

tata1661

@wj-Mcat Can you provide a detailed README.md to compare your empirical results with existing ones provided by PaddleFSL? You can put it in examples/optim. Thanks for the contribution!

wj-Mcat · 2022-04-22T07:55:08Z

Oh sorry, I have relaxed myself a few days. I will run the experiments to get the empirical results in the next few days.

wj-Mcat · 2022-05-09T02:57:58Z

To verify the effectiveness of the optim method, I have done some experiments based on model-zoon algo and optim algo. Below are my conclusion:

Omniglot - MAML

model-zoo

optim

Omniglot - ANIL

model-zoon

optim

MiniImagenet - ANIL

model-zoo

optim

CIFAR-FS - MAML

model-zoo

optim

wj-Mcat · 2022-05-09T03:12:08Z

Metric Overview

Dataset	Algo	model zoo(first order)	Optim(first order)
Omniglot	MAML	97.25 ± 1.7	97.07 ± 2.4
Omniglot	ANIL	93.62 ± 2.08	94.80 ± 3.7
MiniImageNet	ANIL	52.56 ± 3.5	57.50 ± 3.2
CIFAR-FS	MAML	46.88 ± 3.4	49.44 ± 4.7

wj-Mcat · 2022-05-09T09:24:28Z

Thanks for merging, I will try to fix #28 with another PR.

wj-Mcat added 2 commits April 14, 2022 16:29

add maml learner first version

496b663

Merge branch 'master' of github.com:wj-Mcat/FSL-Mate

946cd4f

wj-Mcat added 9 commits April 15, 2022 14:12

update configuration for .DS_Store file

0e2a59b

add mini-imagenet loader

f468a85

revert mini-imagenet & add anil learner

fa35946

add anil learner

972416d

Merge branch 'master' into master

32017eb

update maml example code

f25cc85

update maml examp;e

c230996

update maml example

ccda4e8

complete the maml & anil algriothm

b0b9f5a

tata1661 reviewed Apr 20, 2022

View reviewed changes

update anil experiment records

e5e9eac

wj-Mcat mentioned this pull request Apr 28, 2022

【PaddlePaddle Hackathon 第二期】任务总览 PaddlePaddle/Paddle#40234

Closed

wj-Mcat added 6 commits April 29, 2022 03:19

update the metaopt utils

1dc0958

update the example code of meta learner

f310c6e

remove new line of mini-imagenet file

2017382

ignore the raw_data directory

445947e

update readme metric

1aa61d6

update meta trainer

b2f5e44

wj-Mcat mentioned this pull request May 9, 2022

[Feature Request] Meta Learner Based code #28

Closed

tata1661 merged commit 4e0dae5 into tata1661:master May 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement MAML meta-opt #23

Implement MAML meta-opt #23

wj-Mcat commented Apr 14, 2022

wj-Mcat commented Apr 14, 2022

wj-Mcat commented Apr 19, 2022

tata1661 left a comment

wj-Mcat commented Apr 22, 2022 •

edited

Loading

wj-Mcat commented May 9, 2022

wj-Mcat commented May 9, 2022

wj-Mcat commented May 9, 2022

Implement MAML meta-opt #23

Implement MAML meta-opt #23

Conversation

wj-Mcat commented Apr 14, 2022

Description

Consideration

Design

wj-Mcat commented Apr 14, 2022

wj-Mcat commented Apr 19, 2022

Changes in Learner Structure

tata1661 left a comment

Choose a reason for hiding this comment

wj-Mcat commented Apr 22, 2022 • edited Loading

wj-Mcat commented May 9, 2022

Omniglot - MAML

Omniglot - ANIL

MiniImagenet - ANIL

CIFAR-FS - MAML

wj-Mcat commented May 9, 2022

Metric Overview

wj-Mcat commented May 9, 2022

wj-Mcat commented Apr 22, 2022 •

edited

Loading