International - English

Cart Console

Topic Center

Contact Sales

首頁 > 開發者 > 網站開發

DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks

最後更新：2018-09-05 來源：互聯網

上載者：User

創建阿里雲帳戶，並獲得超過 40 款產品的免費試用版；而企業帳戶則可以享有總值 $1200 的免費試用版。立即註冊！

標籤：step number get 部分 with neu image vol cell

1、Introduction

DL解決VO問題：End-to-End VO with RCNN

2、Network structure

a.CNN based Feature Extraction

　　論文使用KITTI資料集。

　　CNN部分有9個卷積層，除了Conv6，其他的卷積層後都串連1層ReLU，則共有17層。

b、RNN based Sequential Modelling

　　RNN is different from CNN in that it maintains memory of its hidden states over time and has feedback loops among them, which enables its current hidden state to be a function of the previous ones.

　　Given a convolutional feature xk at time k, a RNN updates at time step k by

　　hk and yk are the hidden state and output at time k respectively.

　　W terms denote corresponding weight matrices.

　　b terms denote bias vectors.

　　H is an element-wise nonlinear activation function.

　　LSTM

Folded and unfolded LSTMs and internal structure of its unit.

　　is element-wise product of two vectors.

　　σ is sigmoid non-linearity.

　　tanh is hyperbolic tangent non-linearity.

　　W terms denote corresponding weight matrices.

　　b terms denote bias vectors.

　　ik, f k, gk, ck and ok are input gate, forget gate, input modulation gate, memory cell and output gate.

　　Each of the LSTM layers has 1000 hidden states.

3、損失函數及最佳化

　　The conditional probability of the poses Yt = (y1, . . . , yt) given a sequence of monocular RGB images Xt = (x1, . . . , xt) up to time t.

　　Optimal parameters :

　　The hyperparameters of the DNNs:

　　(pk, φk) is the ground truth pose.

　　(p?k, φ?k) is the estimated ground truth pose.

　　κ (100 in the experiments) is a scale factor to balance the weights of positions and orientations.

　　N is the number of samples.

　　The orientation φ is represented by Euler angles rather than quaternion since quaternion is subject to an extra unit constraint which hinders the optimisation problem of DL.

DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks

本文章原先以中文撰寫並發佈於 aliyun.com，亦設英文版本，僅作資訊用途。本網站不對文章的準確性，完整性或可靠性或其任何翻譯作出任何明示或暗示的陳述或保證。如對該文章有任何疑慮或投訴，請傳送電郵至 info-contact@alibabacloud.com 並提供相關疑慮或投訴的詳細說明。職員會於 5 個工作天內與您聯絡，一經驗證之後，即會刪除該侵權內容。

相關關鍵詞：

自學成才的秘密：115個 web Develop 資源_經典網摘 01-18

H5 中html 頁面存為圖片並長按儲存 09-07

__x__(20)0907第四天__ css 框模型 09-07

SVN提示https認證驗證失敗問題svn: E230001: Server SSL certificate verifi... 09-05

JS如何判斷文字是全形還是半形（轉載） 07-23

Citrix NetScaler產品學習筆記之一：Citrix NetScaler概述 07-08

聯繫我們

該頁面正文內容均來源於網絡整理，並不代表阿里雲官方的觀點，該頁面所提到的產品和服務也與阿里云無關，如果該頁面內容對您造成了困擾，歡迎寫郵件給我們，收到郵件我們將在5個工作日內處理。

如果您發現本社區中有涉嫌抄襲的內容，歡迎發送郵件至： info-contact@alibabacloud.com 進行舉報並提供相關證據，工作人員會在 5 個工作天內聯絡您，一經查實，本站將立刻刪除涉嫌侵權內容。

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

Get Started for Free

Sales Support

1 on 1 presale consultation

Chat Contact Sales
After-Sales Support

24/7 Technical Support 6 Free Tickets per Quarter Faster Response

Open a Ticket
Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.

Learn More

DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks

聯繫我們

熱門內容

熱門主題

A Free Trial That Lets You Build Big!

Sales Support

After-Sales Support