{"id":70867,"date":"2024-05-15T09:00:06","date_gmt":"2024-05-15T13:00:06","guid":{"rendered":"https:\/\/news.samsung.com\/us\/?p=70867"},"modified":"2024-06-13T14:53:33","modified_gmt":"2024-06-13T18:53:33","slug":"the-learning-curve-part-1-why-teaching-ai-new-languages-begins-with-data","status":"publish","type":"post","link":"https:\/\/news.samsung.com\/us\/samsung-teaching-ai-new-languages-begins-with-data-learning-curve-part-1\/","title":{"rendered":"The Learning Curve, Part 1: Why Teaching AI New Languages Begins with Data"},"content":{"rendered":"<p>As Samsung continues to pioneer premium mobile AI experiences, we <a href=\"https:\/\/news.samsung.com\/us\/tag\/the-learning-curve\/\" target=\"_blank\" rel=\"noopener\">visit Samsung Research centers around the world<\/a> to learn how <a href=\"https:\/\/www.samsung.com\/us\/galaxy-ai\/\" target=\"_blank\" rel=\"noopener\">Galaxy AI<\/a><a href=\"#_ftn1\" name=\"_ftnref1\"><sup>1<\/sup><\/a> is enabling more users to maximize their potential. <a href=\"https:\/\/news.samsung.com\/us\/samsung-galaxy-ai-now-supports-more-languages-latest-update\/\" target=\"_blank\" rel=\"noopener\">Galaxy AI now supports 16 languages<\/a>, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as Live Translate<a href=\"#_ftn2\" name=\"_ftnref2\"><sup>2<\/sup><\/a>, Interpreter, Note Assist and Browsing Assist. But what does AI language development involve? This series examines the challenges of working with mobile AI and how we overcame them. First up, we head to Indonesia to learn where one begins teaching AI to speak a new language.<\/p>\n<p><a href=\"https:\/\/www.samsung.com\/us\/galaxy-ai\/\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-70871\" src=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154155\/samsung-learning-curve-part-1-headshot.png\" alt=\"\" width=\"1000\" height=\"600\" srcset=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154155\/samsung-learning-curve-part-1-headshot.png 1000w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154155\/samsung-learning-curve-part-1-headshot-600x360.png 600w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154155\/samsung-learning-curve-part-1-headshot-950x570.png 950w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154155\/samsung-learning-curve-part-1-headshot-664x398.png 664w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154155\/samsung-learning-curve-part-1-headshot-437x262.png 437w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><\/a><\/p>\n<p>The first step is establishing targets, according to the team at Samsung R&amp;D Institute Indonesia (SRIN). \u201cGreat AI begins good quality and relevant data. Each language demands a different way to process this, so we dive deep to understand the linguistic needs and the unique conditions of our country,\u201d says Junaidillah Fadlil, head of AI at SRIN, whose team recently added Bahasa Indonesia (Indonesian language) support to <a href=\"https:\/\/www.samsung.com\/us\/galaxy-ai\/\" target=\"_blank\" rel=\"noopener\">Galaxy AI<\/a>. \u201cLocal language development has to be led by insight and science, so every process for adding languages to <a href=\"https:\/\/www.samsung.com\/us\/galaxy-ai\/\" target=\"_blank\" rel=\"noopener\">Galaxy AI<\/a> starts with us planning what information we need and can legally and ethically obtain.\u201d<\/p>\n<p><a href=\"https:\/\/www.samsung.com\/us\/galaxy-ai\/\" target=\"_blank\" rel=\"noopener\">Galaxy AI<\/a> features such as Live Translate perform three core processes: automatic speech recognition (ASR), machine translation (MT) and text-to-speech (TTS). Each process needs a distinct set of information.<\/p>\n<p><a href=\"https:\/\/www.samsung.com\/us\/galaxy-ai\/\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-70870\" src=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154148\/samsung-learning-curve-part-1-galay-ai.png\" alt=\"\" width=\"1000\" height=\"600\" srcset=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154148\/samsung-learning-curve-part-1-galay-ai.png 1000w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154148\/samsung-learning-curve-part-1-galay-ai-600x360.png 600w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154148\/samsung-learning-curve-part-1-galay-ai-950x570.png 950w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154148\/samsung-learning-curve-part-1-galay-ai-664x398.png 664w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154148\/samsung-learning-curve-part-1-galay-ai-437x262.png 437w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><\/a><\/p>\n<p>ASR, for instance, needs extensive recordings of speech in numerous environments, each paired with an accurate text transcription. Varying background noise levels help account for different environments. \u201cIt\u2019s not enough just to add traffic noises to recordings,\u201d explains Muchlisin Adi Saputra, the team\u2019s ASR lead. \u201cWe must go out into traffic or to a mall where we can authentically capture unique sounds at street level, like people calling out or hammering from a construction site.\u201d<\/p>\n<p><a href=\"https:\/\/www.samsung.com\/us\/galaxy-ai\/\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-70872\" src=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154202\/samsung-learning-curve-part-1-indonesia.png\" alt=\"\" width=\"1000\" height=\"600\" srcset=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154202\/samsung-learning-curve-part-1-indonesia.png 1000w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154202\/samsung-learning-curve-part-1-indonesia-600x360.png 600w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154202\/samsung-learning-curve-part-1-indonesia-950x570.png 950w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154202\/samsung-learning-curve-part-1-indonesia-664x398.png 664w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154202\/samsung-learning-curve-part-1-indonesia-437x262.png 437w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><\/a><\/p>\n<p>Sources of data must also be considered. Saputra adds: \u201cWe need to keep up to date with the latest slang and how it is used, and mostly we find it on social media!\u201d<\/p>\n<div class=\"embedded product-module\">\n\t\t<a id=\"product-module-0\" href=\"https:\/\/www.samsung.com\/us\/smartphones\/galaxy-s26-ultra\/buy\/\" title=\"Galaxy S26 Series\" class=\"snr-article_product-image\" target=\"_blank\">\n\t\t<img decoding=\"async\" src=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/01\/01202144\/2026-Product-Banner-Refresh-1.png\" alt=\"Galaxy S26 Series\">\n\t\t<\/a>\n\t<\/div>\n<p>Next, MT requires translation training data. \u201cTranslating Bahasa Indonesia is challenging,\u201d says Muhamad Faisal, the team\u2019s MT lead. &#8220;Its extensive use of contextual and implicit meanings relies on social and situational cues, so we need numerous translated texts that the AI could reference for new words, foreign words, proper nouns, and idioms \u2013 any information that helps AI understand the context and rules of communication.\u201d<\/p>\n<p><a href=\"https:\/\/www.samsung.com\/us\/galaxy-ai\/\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-70874\" src=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154214\/samsung-learning-curve-part-1-r-d-institute.png\" alt=\"\" width=\"1000\" height=\"600\" srcset=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154214\/samsung-learning-curve-part-1-r-d-institute.png 1000w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154214\/samsung-learning-curve-part-1-r-d-institute-600x360.png 600w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154214\/samsung-learning-curve-part-1-r-d-institute-950x570.png 950w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154214\/samsung-learning-curve-part-1-r-d-institute-664x398.png 664w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154214\/samsung-learning-curve-part-1-r-d-institute-437x262.png 437w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><\/a><\/p>\n<p>TTS then requires recordings that cover a range of voices and tones, with additional context on how parts of words sound in different circumstances. \u201cGood voice recordings could do half the job and cover all the required phonemes (units of sound in speech) for the AI model,\u201d adds Harits Abdurrohman, TTS lead. \u201cIf a voice actor did a great job in the earlier phase, the focus shifts to refining the AI model to clearly pronounce specific words.\u201d<\/p>\n\n\t\t<\/div>\n\t\t<\/div>\n\t\t<div class=\"embedded recommended recommended-post\">\n\t\t\t<div class=\"embedded-inner\">\n\t\t\t\t<div class=\"card-badge\">\n\t\t\t\t\t<p>Recommended News<\/p>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"recommended-card\">\n\t\t\t\t\t<a href=\"https:\/\/news.samsung.com\/us\/one-ui-6-1-update-brings-galaxy-ai-features-to-galaxy-s22-series-and-more\/\" class=\"recommended-news\">\n\t\t\t\t\t\t<img decoding=\"async\" src=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/02151313\/samsung-galaxy-ai-transcript-assist-268x178.jpg\" alt=\"One UI 6.1 Update Brings Galaxy AI Features to Galaxy S22 Series and More\">\n\t\t\t\t\t\t<div class=\"details\">\n\t\t\t\t\t\t\t<div class=\"details-inner\">\n\t\t\t\t\t\t\t\t<p class=\"post-category\">Mobile<\/p>\n\t\t\t\t\t\t\t\t<h4>One UI 6.1 Update Brings Galaxy AI Features to Galaxy S22 Series and More<\/h4>\n\t\t\t\t\t\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/a>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t<\/div>\n\t\t<div class=\"article-content\">\n\t\t<div class=\"article-body\">\n<h5><strong>Stronger Together<\/strong><\/h5>\n<p>It takes vast resources to plan for much data, and SRIN worked closely with linguistics experts. \u201cThis challenge requires creativity, resourcefulness and expertise in both Bahasa Indonesia and machine learning,\u201d Fadlil reflects. \u201cSamsung\u2019s philosophy of open collaboration played a big part in getting the job done, as did our scale of operations and history of AI development.\u201d<\/p>\n<p>Working with other Samsung Research centers around the world, the SRIN team was able to quickly adopt best practices and overcome the complexities of establishing data targets. Furthermore, collaboration was good for advancing not only technology but also culture. When the SRIN team joined their counterparts in Bangalore, India, they observed the local fasting customs, creating deeper connections and expanding their cultural understanding.<\/p>\n<p><a href=\"https:\/\/www.samsung.com\/us\/galaxy-ai\/\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-70875\" src=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154223\/samsung-learning-curve-part-1-research-centers.png\" alt=\"\" width=\"1000\" height=\"600\" srcset=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154223\/samsung-learning-curve-part-1-research-centers.png 1000w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154223\/samsung-learning-curve-part-1-research-centers-600x360.png 600w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154223\/samsung-learning-curve-part-1-research-centers-950x570.png 950w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154223\/samsung-learning-curve-part-1-research-centers-664x398.png 664w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154223\/samsung-learning-curve-part-1-research-centers-437x262.png 437w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><\/a><\/p>\n<p>For the team, <a href=\"https:\/\/www.samsung.com\/us\/galaxy-ai\/\" target=\"_blank\" rel=\"noopener\">Galaxy AI\u2019s language expansion<\/a> project took on a new significance. \u201cWe are particularly proud of our achievements here as this was our first AI project, and it won\u2019t be our last as we continue to refine our models and improve the quality of output,\u201d Fadlil concludes. \u201cThis expansion not only reflects our values but also respects and incorporates our cultural identities through language.\u201d<\/p>\n<p><a href=\"https:\/\/www.samsung.com\/us\/galaxy-ai\/\" target=\"_blank\" rel=\"noopener\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-70869\" src=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154142\/samsung-learning-curve-mobile-part-1.png\" alt=\"\" width=\"1000\" height=\"600\" srcset=\"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154142\/samsung-learning-curve-mobile-part-1.png 1000w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154142\/samsung-learning-curve-mobile-part-1-600x360.png 600w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154142\/samsung-learning-curve-mobile-part-1-950x570.png 950w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154142\/samsung-learning-curve-mobile-part-1-664x398.png 664w, https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154142\/samsung-learning-curve-mobile-part-1-437x262.png 437w\" sizes=\"auto, (max-width: 1000px) 100vw, 1000px\" \/><\/a><\/p>\n<p>In the next episode of this series, we head to Jordan who led Arabic language project to find out how to build an AI for diverse dialects and the complexity behind.<\/p>\n<h6><a href=\"#_ftnref1\" name=\"_ftn1\"><sup>1<\/sup><\/a> \u00a0\u00a0Galaxy AI features by Samsung will be provided for free until the end of 2025 on supported Samsung Galaxy devices.<\/h6>\n<h6><a href=\"#_ftnref2\" name=\"_ftn2\"><sup>2<\/sup><\/a> \u00a0\u00a0Samsung account log-in required. Calls must be made using the native Samsung phone app. Samsung does not make any promises, assurances or guarantees as to the accuracy, completeness or reliability of the output provided by AI features.<\/h6>\n","protected":false},"excerpt":{"rendered":"<p>Samsung Research in Indonesia be part of a series about the people and innovations behind the democratization of mobile AI<\/p>\n","protected":false},"author":84,"featured_media":70873,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[29720,29718,29721],"tags":[953,25940,30339,40,30424,30425],"blue-badge":[],"class_list":["post-70867","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-product-mobile","category-product","category-product-mobile-smartphones","tag-artificial-intelligence-ai","tag-galaxy","tag-galaxy-ai","tag-mobile","tag-samsung-rd","tag-the-learning-curve"],"acf":{"turn_off_retargeting":false},"fimg_mobile_url":"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154142\/samsung-learning-curve-mobile-part-1-200x200.png","fimg_url":"https:\/\/img.us.news.samsung.com\/us\/wp-content\/uploads\/2024\/05\/14154142\/samsung-learning-curve-mobile-part-1-432x286.png","primary_category":{"term_id":29721,"name":"Smartphones","slug":"product-mobile-smartphones","term_group":0,"term_taxonomy_id":29721,"taxonomy":"category","description":"","parent":29720,"count":401,"filter":"raw","term_link":"https:\/\/news.samsung.com\/us\/category\/product\/product-mobile\/product-mobile-smartphones\/","term_path":"product\/product-mobile\/product-mobile-smartphones"},"badge":false,"_links":{"self":[{"href":"https:\/\/news.samsung.com\/us\/wp-json\/wp\/v2\/posts\/70867","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/news.samsung.com\/us\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/news.samsung.com\/us\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/news.samsung.com\/us\/wp-json\/wp\/v2\/users\/84"}],"replies":[{"embeddable":true,"href":"https:\/\/news.samsung.com\/us\/wp-json\/wp\/v2\/comments?post=70867"}],"version-history":[{"count":0,"href":"https:\/\/news.samsung.com\/us\/wp-json\/wp\/v2\/posts\/70867\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/news.samsung.com\/us\/wp-json\/wp\/v2\/media\/70873"}],"wp:attachment":[{"href":"https:\/\/news.samsung.com\/us\/wp-json\/wp\/v2\/media?parent=70867"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/news.samsung.com\/us\/wp-json\/wp\/v2\/categories?post=70867"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/news.samsung.com\/us\/wp-json\/wp\/v2\/tags?post=70867"},{"taxonomy":"blue-badge","embeddable":true,"href":"https:\/\/news.samsung.com\/us\/wp-json\/wp\/v2\/blue-badge?post=70867"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}