Unsupervised deep learning models used in computer vision by eneismijmich

deeplearning · @eneismijmich · May 21 '17

$35.78

Unsupervised deep learning models used in computer vision

<h3>Introduction</h3>

After a break, I wanted to post about this emerging field of unsupervised deep learning, since it is gaining momentum and achieving good results.

The idea of this post is to give introduction and overview of the most used models, tools and useful links to kickstart your dive into an interesting part of computer vision. Unsupervised models are still being heavily researched unlike supervised model such as convolutional neural networks. In the computer vision, there are 3 main applications of these models: dimensionality reduction, clustering similar images[+image retrieval/search] and generating images.

Dimensionality reduction is used to plot high dimensional data and find some insights into data. Images can be visualized in 3D space using t-sne dimensionality reduction method and tensboard projector from tensorflow.

![](https://steemitimages.com/DQmUsFe2ccwb3VwKTa3YTW45k8LhBF7jurqizjHK6W9J7sS/image.png)
**image taken from link http://projector.tensorflow.org/**

As for clustering, grouping not labeled data is a very interesting task because a lot of the data on the web is not labeled. For instance training model on images from video. People have to grab each image and manually label them with class or caption. This is not scalable so here unsupervised models can intervene.

![](https://steemitimages.com/DQmTK56MrU9MENRfdge6745iCYWXnCgjkFvmh5eF6JN7JBb/image.png)
**image link https://indico.io/blog/visualizing-with-t-sne/**

Generative models became popular after the paper Generative Adversarial Networks, 2014. from Ian Goodfellow, Yoshua Bengio and few other researchers was published. Many researchers shifted their focus to combining generative models to achieve good quality images generated from learned distributions. In the image below we can see how the generated image improves in quality training a GAN model.

![](https://steemitimages.com/DQmdnQhB8mDeb2f34yn7VhUcd8gEAa6W58KBLL9Ac45xSnQ/image.png)
**image taken from https://github.com/artcg/BEGAN**

To simplify idea about unsupervised models, their goal is to extract good features that will represent a high dimensional image in lower dimensional space without having labels. Here we will present some of the most used models, auto-encoders, and generative adversarial model.

<h3>Autoencoders</h3>

An autoencoder is a neural network that is not trained classifying images into class and minimizing the error function. But it is trained to reconstruct the input image from hidden layer h. Internally, it has a hidden layer h that describes a code used to represent the input. Let’s see the image [x] below, input image enters the network, goes through layers and is being coded into hidden layer h.

![](https://steemitimages.com/DQmf513t2V9qnFZ7jKv2FFhhhTygGu8bhrvwhLsgZ4yWsg6/image.png)
**image taken from https://www.slideshare.net/TJTorres1/deep-style-using-variational-autoencoders-for-image-generation**

This first part of the network is called encoder [h=f(x)]. After that, we reconstruct the image from hidden layer using the second part of the network called decoder [x’=g(h)]. The learning process is simply calculating the difference between the input image and output image. As we minimize that error our autoencoder network learns to decode high dimensional image into a good representation of the image in lower dimensional space.

Advantages of autoencoder are that it is a simple technique, reconstructing the input, layers can be stacked into stacked auto encoder and has its intuitive based on neuroscience research. But at the moment performance can’t match with supervised learning models and from some image datasets reconstruction of the input is not an ideal metric for learning a general purpose and informational representations.

Below are some links to good implementations to check out:

* https://github.com/cmgreen210/TensorFlowDeepAutoencoder
* https://github.com/musyoku/adversarial-autoencoder

<h3>Generative Adversarial models</h3>

The idea behind generative adversarial model is to have two different smaller neural network models competing. One of them called generator which takes noise as input and generates samples. The other one called discriminator, receives samples from both the generator model and the real image dataset samples. The discriminator has a goal to distinguish between generated and real samples.

![](https://steemitimages.com/DQmYjxk336A7mSmahNuwVUnY6or5GwmBtEJAnjVi6e72Wd9/image.png)
**image taken from https://wiki.tum.de/pages/viewpage.action?pageId=23562510**

These two networks actually have different (adversarial) roles in this continuous game. The generator is learning to produce more realistic samples to trick discriminator while discriminator becomes better and better distinguishing generated data from real. Networks are trained simultaneously and end up generating high quality images. There are many different implementations using different models, loss functions, you find a curated list of them in this link:

https://github.com/hindupuravinash/the-gan-zoo/blob/master/README.md

Below are some links to good implementations to check out:

* https://github.com/pytorch/examples/tree/master/dcgan
* https://github.com/artcg/BEGAN
* https://github.com/musyoku/wasserstein-gan

## Most popular tools and libraries used in the field

* Tensorflow by Google, it is most used at the moment with huge community. There are good tutorials on https://www.tensorflow.org/tutorials/
* MXNET adapted by Amazon. They have huge list of models implemented in their github repository
* Torch used by Facebook. Torch is mostly used with Lua language but there is also python version PyTorch.

## Useful links and materials

* Most used book in the field
http://www.deeplearningbook.org/
* Best research paper search website [Machine Learning, Deep Learning, Computer Vision]
http://www.arxiv-sanity.com/
* There are good discussions on Reddit with cited researchers adding to discussions
https://www.reddit.com/r/MachineLearning/
https://www.reddit.com/r/deeplearning/
* Search the google using [github + model] name you want to learn about, because there are plenty of implementations available to learn from

Hope you like this intro to unsupervised part of the computer vision.
Happy exploring.

👍 arama, trafalgar, engagement, infovore, slowwalker, jackkang, thecyclist, dimimp, inchonbitcoin, steemservices, b0y2k, aomura, atomrigs, sephiroth, richman, bue, booja, tommycoin, themagus, yougotflagged, opheliafu, theyeti, jonathanyoung, igster, thebotkiller, rznag, vandal, andrewawerdna, mallorca, trogdor, grey580, berniesanders, bue-witness, ullikume, danknugs, boy, asim, trans-juanmi, cryptofunk, mini, michaellamden68, timbot606, bitland, daniel.pan, illbeyourfriend, fleur, grande-fazial, eneismijmich, siniceku, healthcare, fernandam, binoddahal, bunny, kiambi, noeva, crawfish37, craigslist, psych101, elitewizard407, carl760, lifeisamazing, helen.tan, juliosalas, elevator09, and 17 others
👎 casido, torontocul, trucmuche, gladiatorwork, poritoza, astalavasti, trucmipo, cleversteem, vladimirtopiev

`post_id`	2,915,569
`author`	eneismijmich
`permlink`	unsupervised-deep-learning-models-used-in-computer-vision
`category`	deeplearning
`json_metadata`	"{"app": "steemit/0.1", "format": "markdown", "links": ["http://projector.tensorflow.org/", "https://indico.io/blog/visualizing-with-t-sne/", "https://github.com/artcg/BEGAN", "https://www.slideshare.net/TJTorres1/deep-style-using-variational-autoencoders-for-image-generation", "https://wiki.tum.de/pages/viewpage.action?pageId=23562510", "https://github.com/hindupuravinash/the-gan-zoo/blob/master/README.md"], "image": ["https://steemitimages.com/DQmUsFe2ccwb3VwKTa3YTW45k8LhBF7jurqizjHK6W9J7sS/image.png"], "tags": ["deeplearning", "computervision", "autoencoders", "gan", "unsupervised"]}"
`created`	2017-05-21 10:37:27
`last_update`	2017-05-21 10:37:27
`depth`	0
`children`	6
`net_rshares`	7,706,354,710,221
`last_payout`	2017-05-28 10:37:27
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	30.819 SBD
`curator_payout_value`	4.964 SBD
`pending_payout_value`	0.000 SBD
`promoted`	0.000 SBD
`body_length`	6,333
`author_reputation`	94,044,485,172,635
`root_title`	"Unsupervised deep learning models used in computer vision"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 SBD
`percent_steem_dollars`	10,000
`author_curate_reward`	""

properties (23)vote details (90)

voter	rshares	pct
berniesanders	9,066,105,659	10%
boy	6,454,506,974	100%
bue-witness	7,870,181,244	100%
bunny	1,143,703,974	100%
bue	129,660,840,981	100%
danknugs	6,892,742,211	10%
steemservices	234,779,949,689	10%
mini	3,449,826,847	100%
moon	438,622,134	100%
b0y2k	219,329,819,131	100%
healthcare	1,285,645,461	100%
daniel.pan	2,033,921,261	100%
sativa	71,911,710	10%
indica	55,310,402	10%
helen.tan	587,988,855	100%
atomrigs	180,594,717,571	100%
dimimp	290,329,925,956	100%
richman	159,464,702,126	100%
cryptofunk	3,495,522,661	99%
applecrisp	239,814,371	100%
infovore	562,337,498,958	100%
trogdor	12,040,161,749	10%
grey580	9,147,148,580	100%
michaellamden68	3,337,863,275	100%
booja	110,252,644,497	100%
jackkang	444,118,356,112	51%
slowwalker	555,198,882,172	15%
asim	5,008,386,042	100%
bitland	2,129,601,264	100%
igster	24,840,002,112	100%
sephiroth	167,116,524,890	100%
opheliafu	32,117,226,801	63%
carlidos	107,729,517	100%
rznag	17,602,657,007	50%
ap2002	71,652,353	100%
craigslist	869,928,897	100%
ullikume	6,937,404,134	100%
elena000	207,054,252	100%
themagus	45,057,140,753	100%
tommycoin	59,255,857,986	100%
thebotkiller	19,091,123,038	10%
eneismijmich	1,432,940,288	100%
andrewawerdna	14,569,067,525	100%
mrsteemitbwhale	328,602,682	100%
inchonbitcoin	289,973,857,840	100%
thecyclist	366,595,431,186	10%
seckorama	55,363,944	100%
jonathanyoung	25,192,104,015	100%
mallorca	12,980,197,988	100%
arama	1,270,302,743,842	100%
cupang	72,319,227	100%
psych101	828,312,377	100%
timbot606	3,240,112,130	100%
lifeisamazing	742,410,618	100%
siniceku	1,361,943,891	100%
trans-juanmi	3,963,612,066	60%
countryfolk1	422,751,286	100%
illbeyourfriend	1,855,760,100	10%
engagement	801,256,599,792	10%
bottymcbotface	226,736,448	81%
juliosalas	569,627,334	60%
noeva	1,033,876,571	100%
eem	84,376,952	47%
theyeti	25,427,261,582	10%
yougotflagged	43,986,636,461	10%
fleur	1,566,963,996	80%
trafalgar	1,268,831,869,330	24%
crawfish37	929,194,179	100%
elevator09	471,573,637	100%
aomura	210,277,798,642	100%
binoddahal	1,169,578,277	100%
fernandam	1,228,124,075	100%
carl760	744,608,609	100%
vandal	17,132,333,769	100%
kiambi	1,075,748,597	100%
grande-fazial	1,515,369,352	100%
tremoulinas	0	100%
elitewizard407	818,298,006	100%
robik	0	100%
ahnassif	0	100%
shikibyakko	0	100%
vladimirtopiev	0	-100%
cleversteem	0	-100%
trucmipo	0	-100%
astalavasti	0	-100%
poritoza	0	-100%
gladiatorwork	0	-100%
trucmuche	0	-100%
torontocul	0	-100%
casido	0	-100%

@binoddahal · May 21 '17

Yes....u r right.....You showcasing us reality...

👍 binoddahal

properties (23)vote details (1)

voter	weight	wgt%	rshares	pct	time
binoddahal	0 B		1,145,709,333	100%

@sjovmaiin · May 21 '17

https://i.imgflip.com/pwi28.jpg

👍 sjovmaiin

`post_id`	2,915,978
`author`	sjovmaiin
`permlink`	re-eneismijmich-unsupervised-deep-learning-models-used-in-computer-vision-20170521t110651338z
`category`	deeplearning
`json_metadata`	"{"app": "steemit/0.1", "image": ["https://i.imgflip.com/pwi28.jpg"], "tags": ["deeplearning"]}"
`created`	2017-05-21 11:06:51
`last_update`	2017-05-21 11:06:51
`depth`	1
`children`	0
`net_rshares`	788,724,774
`last_payout`	2017-05-28 11:06:51
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.000 SBD
`curator_payout_value`	0.000 SBD
`pending_payout_value`	0.000 SBD
`promoted`	0.000 SBD
`body_length`	31
`author_reputation`	-71,706,009,704
`root_title`	"Unsupervised deep learning models used in computer vision"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 SBD
`percent_steem_dollars`	10,000
`author_curate_reward`	""

properties (23)vote details (1)

voter	weight	wgt%	rshares	pct	time
sjovmaiin	0 B		788,724,774	100%

@betacore · Jul 11 '17 (edited)

extremely well made post on a topic that needs more people learning about it. AI is possibly the most important technology ever created and yet most people have no clue about how far it can really go. If you're interested in learning even more, I recommend the Two Minute Papers channel on YouTube, it's a scholarly synopsis format of over 150 papers mostly related directly to AI. Give it a look sometime.

Thank you for posting.

👍 butterfly-effect

`post_id`	6,813,790
`author`	betacore
`permlink`	re-eneismijmich-unsupervised-deep-learning-models-used-in-computer-vision-20170711t032121329z
`category`	deeplearning
`json_metadata`	"{"app": "steemit/0.1", "tags": ["deeplearning"]}"
`created`	2017-07-11 03:21:18
`last_update`	2017-07-11 03:21:51
`depth`	1
`children`	0
`net_rshares`	2,876,588,828
`last_payout`	2017-07-18 03:21:18
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.000 SBD
`curator_payout_value`	0.000 SBD
`pending_payout_value`	0.000 SBD
`promoted`	0.000 SBD
`body_length`	430
`author_reputation`	41,900,791,057
`root_title`	"Unsupervised deep learning models used in computer vision"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 SBD
`percent_steem_dollars`	10,000
`author_curate_reward`	""

properties (23)vote details (1)

voter	weight	wgt%	rshares	pct	time
butterfly-effect	0 B		2,876,588,828	100%

@steemitboard · Jul 26 '17

Congratulations @eneismijmich! You have received a personal award!

[![](https://steemitimages.com/70x70/http://steemitboard.com/@eneismijmich/birthday1.png)](http://steemitboard.com/@eneismijmich) Happy Birthday - 1 Year on Steemit
Click on the badge to view your own Board of Honor on SteemitBoard.

For more information about this award, click [here](https://steemit.com/steemitboard/@steemitboard/steemitboard-update-8-happy-birthday)
> By upvoting this notification, you can help all Steemit users. Learn how [here](https://steemit.com/steemitboard/@steemitboard/http-i-cubeupload-com-7ciqeo-png)!

properties (22)

`post_id`	8,344,774
`author`	steemitboard
`permlink`	steemitboard-notify-eneismijmich-20170726t121129000z
`category`	deeplearning
`json_metadata`	"{"image": ["https://steemitboard.com/img/notifications.png"]}"
`created`	2017-07-26 12:11:27
`last_update`	2017-07-26 12:11:27
`depth`	1
`children`	0
`net_rshares`	0
`last_payout`	2017-08-02 12:11:27
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.000 SBD
`curator_payout_value`	0.000 SBD
`pending_payout_value`	0.000 SBD
`promoted`	0.000 SBD
`body_length`	602
`author_reputation`	38,705,954,145,809
`root_title`	"Unsupervised deep learning models used in computer vision"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 SBD
`percent_steem_dollars`	10,000

@steemitboard · Jul 26 '18

Congratulations @eneismijmich! You have received a personal award!

[![](https://steemitimages.com/70x70/http://steemitboard.com/@eneismijmich/birthday2.png)](http://steemitboard.com/@eneismijmich)  2 Years on Steemit
<sub>_Click on the badge to view your Board of Honor._</sub>


> Do you like [SteemitBoard's project](https://steemit.com/@steemitboard)? Then **[Vote for its witness](https://v2.steemconnect.com/sign/account-witness-vote?witness=steemitboard&approve=1)** and **get one more award**!

properties (22)

`post_id`	57,770,880
`author`	steemitboard
`permlink`	steemitboard-notify-eneismijmich-20180726t133014000z
`category`	deeplearning
`json_metadata`	{"image":["https:\/\/steemitboard.com\/img\/notify.png"]}
`created`	2018-07-26 13:30:12
`last_update`	2018-07-26 13:30:12
`depth`	1
`children`	0
`net_rshares`	0
`last_payout`	2018-08-02 13:30:12
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.000 SBD
`curator_payout_value`	0.000 SBD
`pending_payout_value`	0.000 SBD
`promoted`	0.000 SBD
`body_length`	501
`author_reputation`	38,705,954,145,809
`root_title`	"Unsupervised deep learning models used in computer vision"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 SBD
`percent_steem_dollars`	10,000

@steemitboard · Jul 26 '19

Congratulations @eneismijmich! You received a personal award!

<table><tr><td>https://steemitimages.com/70x70/http://steemitboard.com/@eneismijmich/birthday3.png</td><td>Happy Birthday! - You are on the Steem blockchain for 3 years!</td></tr></table>

<sub>_You can view [your badges on your Steem Board](https://steemitboard.com/@eneismijmich) and compare to others on the [Steem Ranking](https://steemitboard.com/ranking/index.php?name=eneismijmich)_</sub>


###### [Vote for @Steemitboard as a witness](https://v2.steemconnect.com/sign/account-witness-vote?witness=steemitboard&approve=1) to get one more award and increased upvotes!

properties (22)

`post_id`	78,368,508
`author`	steemitboard
`permlink`	steemitboard-notify-eneismijmich-20190726t124821000z
`category`	deeplearning
`json_metadata`	{"image":["https:\/\/steemitboard.com\/img\/notify.png"]}
`created`	2019-07-26 12:48:21
`last_update`	2019-07-26 12:48:21
`depth`	1
`children`	0
`net_rshares`	0
`last_payout`	2019-08-02 12:48:21
`cashout_time`	1969-12-31 23:59:59
`total_payout_value`	0.000 SBD
`curator_payout_value`	0.000 SBD
`pending_payout_value`	0.000 SBD
`promoted`	0.000 SBD
`body_length`	636
`author_reputation`	38,705,954,145,809
`root_title`	"Unsupervised deep learning models used in computer vision"
`beneficiaries`	`[]`
`max_accepted_payout`	1,000,000.000 SBD
`percent_steem_dollars`	10,000