Speech enabled Integrated AR-based Multimodal Language Translation

被引:0
|
作者
Bhargava, Mahesh [1 ]
Dhote, Pavan [1 ]
Srivastava, Amit [1 ]
Kumar, Ajai [1 ]
机构
[1] Ctr Dev Adv Comp C DAC, AAI, Pune, Maharashtra, India
关键词
Augmented Reality; visual data translation Optical Character Recognition (OCR) and speech synthesis;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With rapidly increasing smartphone user base in the country, many opportunities revolving the same can be explored. India being a multilingual nation, domestic/international tourist needs several language solutions on their fingertips. Applications such as instant language translation, maps, travel assistance, routes, restaurants, banks etc. have become a necessity on smartphones in the current scenario. We present a speech enabled augmented reality (AR) smartphones application for Hindi to English translation and vice-versa. With application interface user can move their smartphone over the word, phrase, billboard, signboard, hoarding, milestone etc. to get translated text. Using augmented reality technology the translated text seamlessly overlay [1] to the original text in real time video stream. This is achieved by critical mix of underlying technologies such as augmented reality, Optical Character Recognition (OCR), translation of visual data and speech synthesis. User will be able to effectively engage, communicate and access information originally available in native language.
引用
收藏
页码:226 / 230
页数:5
相关论文
共 50 条
  • [31] AR-Based Food Traceability as a Means for Sustainable Development
    Dimou, Victoria
    Styliaras, Georgios D.
    Salomidis, Konstantinos
    SUSTAINABILITY, 2024, 16 (07)
  • [32] COMPARISON OF AR-BASED ALGORITHMS FOR RESPIRATORY SOUNDS CLASSIFICATION
    SANKUR, B
    KAHYA, YP
    GULER, EC
    ENGIN, T
    COMPUTERS IN BIOLOGY AND MEDICINE, 1994, 24 (01) : 67 - 76
  • [33] AR-based Merging Assistance at Expressway and Its Verification
    Takahashi, Sho
    Maruyama, Ryohei
    Hagiwara, Toru
    INTERNATIONAL JOURNAL OF INTELLIGENT TRANSPORTATION SYSTEMS RESEARCH, 2024, 22 (03) : 675 - 686
  • [34] Multimodal Speech Synthesis for Polish Language
    Szklanny, Krzysztof
    MAN-MACHINE INTERACTIONS 3, 2014, 242 : 325 - 333
  • [35] A New AR-Based Framework For Wrist Assessment And Rehabilitation
    Zeng, Lu
    Ji, Bingjin
    Li, Xiao
    PROCEEDINGS OF THE 2017 2ND JOINT INTERNATIONAL INFORMATION TECHNOLOGY, MECHANICAL AND ELECTRONIC ENGINEERING CONFERENCE (JIMEC 2017), 2017, 62 : 503 - 507
  • [36] Developing the AR-Based System to Assist MEP Construction
    Feng C.-W.
    Chang C.-H.
    Chou C.-C.
    Journal of the Chinese Institute of Civil and Hydraulic Engineering, 2022, 34 (06): : 541 - 550
  • [37] Multimodal machine translation through visuals and speech
    Sulubacak, Umut
    Caglayan, Ozan
    Gronroos, Stig-Arne
    Rouhe, Aku
    Elliott, Desmond
    Specia, Lucia
    Tiedemann, Jorg
    MACHINE TRANSLATION, 2020, 34 (2-3) : 97 - 147
  • [38] Improving student academic emotions and learning satisfaction in lectures in a foreign language with speech-enabled language translation technology
    Shadiev, Rustam
    Huang, Yueh Min
    AUSTRALASIAN JOURNAL OF EDUCATIONAL TECHNOLOGY, 2022, 38 (03) : 202 - 213
  • [39] Graph-Based Multimodal Sequential Embedding for Sign Language Translation
    Tang, Shengeng
    Guo, Dan
    Hong, Richang
    Wang, Meng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4433 - 4445
  • [40] Applications of Language Modeling in Speech-To-Speech Translation
    Liu, Fu-Hua
    Gu, Liang
    Gao, Yuqing
    Picheny, Michael
    International Journal of Speech Technology, 2004, 7 (2-3) : 221 - 229