Speech enabled Integrated AR-based Multimodal Language Translation

被引:0
|
作者
Bhargava, Mahesh [1 ]
Dhote, Pavan [1 ]
Srivastava, Amit [1 ]
Kumar, Ajai [1 ]
机构
[1] Ctr Dev Adv Comp C DAC, AAI, Pune, Maharashtra, India
关键词
Augmented Reality; visual data translation Optical Character Recognition (OCR) and speech synthesis;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With rapidly increasing smartphone user base in the country, many opportunities revolving the same can be explored. India being a multilingual nation, domestic/international tourist needs several language solutions on their fingertips. Applications such as instant language translation, maps, travel assistance, routes, restaurants, banks etc. have become a necessity on smartphones in the current scenario. We present a speech enabled augmented reality (AR) smartphones application for Hindi to English translation and vice-versa. With application interface user can move their smartphone over the word, phrase, billboard, signboard, hoarding, milestone etc. to get translated text. Using augmented reality technology the translated text seamlessly overlay [1] to the original text in real time video stream. This is achieved by critical mix of underlying technologies such as augmented reality, Optical Character Recognition (OCR), translation of visual data and speech synthesis. User will be able to effectively engage, communicate and access information originally available in native language.
引用
收藏
页码:226 / 230
页数:5
相关论文
共 50 条
  • [1] Multimodal Unsupervised Speech Translation for Recognizing and Evaluating Second Language Speech
    Lee, Yun Kyung
    Park, Jeon Gue
    APPLIED SCIENCES-BASEL, 2021, 11 (06):
  • [2] An AR-Based VLAN Visualizer
    Nagatomo, Yutaka
    Nishino, Hiroaki
    Kagawa, Tsuneo
    2014 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2014,
  • [3] Feasibility of an AR-based Memory Training
    Lorentz, Lukas
    Mueller, Kristina
    Lendt, Michael
    Studer, Bettina
    BRAIN INJURY, 2022, 36 : 39 - 39
  • [4] Intelligent AR-based assistance systems
    Quandt, Moritz
    Freitag, Michael
    Stern, Hendrik
    WT Werkstattstechnik, 2024, 114 (06): : 325 - 333
  • [5] Evaluating the translation of speech to virtually-performed sign language on AR glasses
    Lan Thao Nguyen
    Schicktanz, Florian
    Stankowski, Aeneas
    Avramidis, Eleftherios
    2021 13TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2021, : 141 - 144
  • [6] AR-Chat: an AR-based instant messaging system
    Jouet, Pierrick
    Alleaume, Vincent
    Laurent, Anthony
    Fradet, Matthieu
    Luo, Tao
    Baillard, Caroline
    ADJUNCT PROCEEDINGS OF THE 2020 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR-ADJUNCT 2020), 2020, : 153 - 157
  • [7] Scenarios Exploration: How AR-Based Speech Balloons Enhance Car-to-Pedestrian Interaction
    Gui, Xinyue
    Chan, Chia-Ming
    Seo, Stela H.
    Toda, Koki
    Igarashi, Takeo
    HCI INTERNATIONAL 2024 POSTERS, PT V, HCII 2024, 2024, 2118 : 223 - 230
  • [8] Enhancing immersiveness in AR-based product design
    Ha, Taejin
    Kim, Yeongmi
    Ryu, Jeha
    Woo, Woontack
    ADVANCES IN ARTIFICIAL REALITY AND TELE-EXISTENCE, PROCEEDINGS, 2006, 4282 : 207 - +
  • [9] Building an AR-based smart campus platform
    Shian-Shyong Tseng
    Shih-Nung Chen
    Tsung-Yu Yang
    Multimedia Tools and Applications, 2022, 81 : 5695 - 5716
  • [10] AR-based Navigation Using Hybrid Map
    Gu, Yanlei
    Chidsin, Woranipit
    Goncharenko, Igor
    2021 IEEE 3RD GLOBAL CONFERENCE ON LIFE SCIENCES AND TECHNOLOGIES (IEEE LIFETECH 2021), 2021, : 266 - 267