facenet triplet loss with keras

后端未结

关注

 4  1787

野趣味

I am trying to implement facenet in Keras with Thensorflow backend and I have some problem with the triplet loss.

I call the fit function with 3*n number of images and t

相关标签:

4条回答

野趣味

2021-01-31 10:25

Are you constraining your embeddings to "be on a d-dimensional hypersphere"? Try running tf.nn.l2_normalize on your embeddings right after they come out of the CNN.

The problem could be that the embeddings are sort of being smart-alecs. One easy way to reduce the loss is to just set everything to zero. l2_normalize forces them to be unit length.

It looks you'll want to add the normalizing right after the last average pool.

0 讨论(0)
发布评论:

提交评论
- 加载中...
青春惊慌失措

2021-01-31 10:28

What could have happened, other than the learning rate was simply too high, was that an unstable triplet selection strategy had been used, effectively. If, for example, you only use 'hard triplets' (triplets where the a-n distance is smaller than the a-p distance), your network weights might collapse all embeddings to a single point (making the loss always equal to margin (your _alpha), because all embedding distances are zero).

This can be fixed by using other kinds of triplets as well (like 'semi-hard triplets' where a-p is smaller than a-n, but the distance between a-p and a-n is still smaller than margin). So maybe if you always checked for this... It is explained in more detail in this blog post: https://omoindrot.github.io/triplet-loss

0 讨论(0)
发布评论:

提交评论
- 加载中...
眼角桃花

2021-01-31 10:40

I have met the same problem, and I did some research work. I think it is because triplelet loss needs multiple inputs, which may cause the network to generate outputs like that. I haven't fix the problem yet, but you can check the issue page of keras for more details https://github.com/keras-team/keras/issues/9498.

In the issue page, I implemented a fake dataset and a fake triplet loss to repreduce the problem, after I changed the input structure of the network, the loss becomes normal

0 讨论(0)
发布评论:

提交评论
- 加载中...

长发绾君心

2021-01-31 10:42

the loss function in tensorflow requires a list of labels, i.e. list of integers. I think you are passing a 2D matrix, i.e. one hot encoding.

Try this

import keras.backend as K
from tf.contrib.losses.metric_learning import triplet_semihard_loss

def loss(y_true, y_pred):
    y_true = K.argmax(y_true, axis = -1)
    return triplet_semihard_loss(labels=y_true, embeddings=y_pred, margin=1.)

0 讨论(0)