=====Reading List===== ----------------------- ===Hierarchical image embedding=== * Barz, Björn, and Joachim Denzler. "[[https://arxiv.org/pdf/1809.09924.pdf | Hierarchy-based image embeddings for semantic image retrieval.]]" In 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 638-647. IEEE, 2019. * Verma, Nakul, Dhruv Mahajan, Sundararajan Sellamanickam, and Vinod Nair. "[[http://www.cs.toronto.edu/~vnair/cvpr12.pdf | Learning hierarchical similarity metrics.]]" In 2012 IEEE conference on computer vision and pattern recognition, pp. 2280-2287. IEEE, 2012. * Tianshui Chen et al. [[https://arxiv.org/pdf/1808.04505.pdf | Fine-Grained Representation Learning and Recognition by Exploiting Hierarchical Semantic Embedding]] ===Other image embedding works=== * Oh Song, Hyun, Yu Xiang, Stefanie Jegelka, and Silvio Savarese. "[[https://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Song_Deep_Metric_Learning_CVPR_2016_paper.pdf | Deep metric learning via lifted structured feature embedding.]]" In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4004-4012. 2016. * Nagarajan, Tushar, and Kristen Grauman. "[[http://www.cs.utexas.edu/~grauman/papers/attributes-operators-eccv2018.pdf | Attributes as operators: factorizing unseen attribute-object compositions.]]" In Proceedings of the European Conference on Computer Vision (ECCV), pp. 169-185. 2018. ===Text embedding=== * Dyer, Chris. "[[https://arxiv.org/pdf/1410.8251.pdf | Notes on noise contrastive estimation and negative sampling.]]" arXiv preprint arXiv:1410.8251 (2014). //Critiques on negative sampling// * Goldberg, Yoav, and Omer Levy. "[[https://arxiv.org/pdf/1402.3722.pdf | word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method.]]" arXiv preprint arXiv:1402.3722 (2014). //Justification for word2vec and negative sampling// * Mnih, Andriy, and Yee Whye Teh. "[[https://arxiv.org/pdf/1206.6426.pdf | A fast and simple algorithm for training neural probabilistic language models.]]" arXiv preprint arXiv:1206.6426 (2012). //on Noise contrastive estimation (NCE)// ===Hierarchical clustering=== * Dasgupta, Sanjoy. "[[https://arxiv.org/pdf/1510.05043.pdf | A cost function for similarity-based hierarchical clustering]]." arXiv preprint arXiv:1510.05043 (2015). * Heller, Katherine A., and Zoubin Ghahramani. "[[http://mlg.eng.cam.ac.uk/zoubin/papers/icml05heller.pdf | Bayesian hierarchical clustering.]]" In Proceedings of the 22nd international conference on Machine learning, pp. 297-304. ACM, 2005.