<?xml version="1.0" encoding="UTF-8"?><?xml-stylesheet title="XSL_formatting" type="text/xsl" href="https://news.samsung.com/global/wp-content/plugins/btr_rss/btr_rss.xsl"?><rss version="2.0"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:wfw="http://wellformedweb.org/CommentAPI/"
     xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:atom="http://www.w3.org/2005/Atom"
     xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
     xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	 xmlns:media="http://search.yahoo.com/mrss/"
	>
	<channel>
		<title>The Learning Curve &#8211; Samsung Global Newsroom</title>
		<atom:link href="https://news.samsung.com/global/tag/the-learning-curve/feed" rel="self" type="application/rss+xml" />
		<link>https://news.samsung.com/global</link>
        <image>
            <url>https://img.global.news.samsung.com/image/newlogo/logo_samsung-newsroom.png</url>
            <title>The Learning Curve &#8211; Samsung Global Newsroom</title>
            <link>https://news.samsung.com/global</link>
        </image>
        <currentYear>2024</currentYear>
        <cssFile>https://news.samsung.com/global/wp-content/plugins/btr_rss/btr_rss_xsl.css</cssFile>
		<description>What's New on Samsung Newsroom</description>
		<lastBuildDate>Thu, 02 Apr 2026 18:21:43 +0000</lastBuildDate>
		<language>en-US</language>
		<sy:updatePeriod>hourly</sy:updatePeriod>
		<sy:updateFrequency>1</sy:updateFrequency>
					<item>
				<title>[Recap] The Learning Curve: How Samsung’s R&D Institutes Around the World Worked on Galaxy AI</title>
				<link>https://news.samsung.com/global/recap-the-learning-curve-how-samsungs-rd-institutes-around-the-world-worked-on-galaxy-ai</link>
				<pubDate>Tue, 30 Jul 2024 14:00:48 +0000</pubDate>
								<media:content url="https://img.global.news.samsung.com/global/wp-content/uploads/2024/07/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institutes-Around-the-World_Thumbnail728.jpg" medium="image" />
				<dc:creator><![CDATA[Samsung Newsroom]]></dc:creator>
						<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Galaxy AI]]></category>
		<category><![CDATA[Interpreter]]></category>
		<category><![CDATA[Live Translate]]></category>
		<category><![CDATA[Samsung R&D Institute]]></category>
		<category><![CDATA[Text-to-speech]]></category>
		<category><![CDATA[The Learning Curve]]></category>
                <guid isPermaLink="false">https://bit.ly/46q0JBj</guid>
									<description><![CDATA[Galaxy AI has already helped millions of users around the world connect and communicate. On-device AI features based on large language models (LLMs) — such as Live Translate, Interpreter, Note Assist and Browsing Assist — supports 16 languages, with four more coming by the end of the year. The process of building language features for […]]]></description>
																<content:encoded><![CDATA[<p>Galaxy AI has already helped <a href="https://news.samsung.com/global/galaxy-unpacked-2024-the-future-of-mobile-ai-expert-panel-highlights-collaborative-responsible-ai-innovation#:~:text=Galaxy%20AI%20has%20already%20been%20used%20on%20more%20than%20100%20million%20devices" target="_blank" rel="noopener">millions</a> of users around the world connect and communicate. On-device AI features based on large language models (LLMs) — such as Live Translate, Interpreter, Note Assist and Browsing Assist — <a href="https://bit.ly/3VRKNEZ" target="_blank" rel="noopener">supports 16 languages</a>, with four more coming by the end of the year.</p>
<p>The process of building language features for Galaxy AI involved much time and effort as each language presents a unique structure and culture. Samsung’s Researchers from around the world — in Brazil, China, India, Indonesia, Japan, Jordan, Poland and Vietnam — shared the challenges and triumphs behind the development of Galaxy AI. Samsung Newsroom compiled a recap of their stories below.</p>
<h3><span style="color: #000080"><strong>Developing a Translation Model</strong></span></h3>
<p>Galaxy AI features such as Live Translate perform three core processes: automatic speech recognition (ASR), neural machine translation (NMT) and text-to-speech (TTS).</p>
<div id="attachment_154357" style="width: 1010px" class="wp-caption alignnone"><img aria-describedby="caption-attachment-154357" class="size-full wp-image-154357" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/07/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institutes-Around-the-World_main1.jpg" alt="" width="1000" height="667" /><p id="caption-attachment-154357" class="wp-caption-text">▲ Automatic speech recognition (ASR), neural machine translation (NMT) and text-to-speech (TTS) each require distinct sets of information for training</p></div>
<p><a href="https://bit.ly/3yxPn1t" target="_blank" rel="noopener">Samsung R&D Institute Vietnam</a> (SRV) faced obstacles with automatic speech recognition (ASR) models because Vietnamese is a language with six distinct tones. Tonal languages can be difficult for AI to recognize because of the complexity tones add to linguistic nuances. SRV responded to the challenge with a model that differentiates between shorter audio frames of around 20 milliseconds.</p>
<p><a href="https://bit.ly/3W89pJa" target="_blank" rel="noopener">Samsung R&D Institute Poland</a> (SRPOL) had the mammoth hurdle of training neural machine translation (NMT) models for a continent as diverse as Europe. Leveraging its rich pool of experience in projects spanning more than 30 languages across four time zones, SRPOL was able to navigate the untranslatability of certain phrases and handle idiomatic expressions that may not have direct equivalents in other languages.</p>
<p><a href="https://bit.ly/3V4yRyR" target="_blank" rel="noopener">Samsung R&D Institute Jordan</a> (SRJO) adapted Arabic — a language spoken across more than 20 countries in about 30 dialects — for Galaxy AI. Creating a text-to-speech (TTS) model was no small endeavor since diacritics and guides for pronunciation are widely understood by native Arabic speakers but absent in writing. Based on a sophisticated prediction model for missing diacritics, SRJO was able to publish a language model that understands dialects and can answer in standard Arabic.</p>
<h3><span style="color: #000080"><strong>The Importance of Data</strong></span></h3>
<p>Throughout the process of training Galaxy AI in each language, an overarching theme was the importance of open collaboration with local institutions. The quality of data used directly affects the accuracy of ASR, NMT and TTS. So Samsung worked with various partners to obtain and review data that reflected each region’s jargon, dialects and other variations.</p>
<div id="attachment_154358" style="width: 1010px" class="wp-caption alignnone"><img aria-describedby="caption-attachment-154358" class="size-full wp-image-154358" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/07/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institutes-Around-the-World_main2.jpg" alt="" width="1000" height="571" /><p id="caption-attachment-154358" class="wp-caption-text">▲ Each language has a distinct set of qualities that pose challenges in creating an AI language model for it. Tones add to the complexity for tonal languages such as Vietnamese.</p></div>
<p><a href="https://bit.ly/3znJWmb" target="_blank" rel="noopener">Samsung R&D Institute India-Bangalore</a> (SRI-B) collaborated with the Vellore Institute of Technology to secure almost a million lines of segmented and curated audio data on conversational speech, words and commands. The students got hands-on experience on a real-life project as well as mentorship from Samsung experts; the rich store of data helped SRI-B train Galaxy AI in Hindi, covering more than 20 regional dialects and their respective tonal inflections, punctuation and colloquialisms</p>
<p>Local linguistic insights were imperative for the Latin American Spanish model because the diversity within the language is mirrored by the diversity of its user base. For example, the word for swimming pool could be <em>alberca </em>(Mexico), <em>piscina </em>(Colombia, Bolivia, Venezuela) or <em>pileta</em> (Argentina, Paraguay, Uruguay) based on which region you’re from. <a href="https://bit.ly/45jblBF" target="_blank" rel="noopener">Samsung R&D Institute Brazil</a> (SRBR) worked with science and technology institutes SiDi and Sidia to collect and manage massive amounts of data as well as refine and improve upon audio and text sources for Galaxy AI’s Latin American Spanish model.</p>
<p><a href="https://bit.ly/4c1YkhU" target="_blank" rel="noopener">Samsung R&D Institute China</a>-Beijing (SRC-B) and Samsung R&D Institute China-Guangzhou (SRC-G) partnered with Chinese companies Baidu and Meitu to leverage their expertise from developing large language models (LLM) such as ERNIE Bot and MiracleVision, respectively. As a result, Galaxy AI supports both main modes of Mandarin Chinese and Cantonese.</p>
<p>In addition to external cooperation, due diligence and internal resources were also essential.</p>
<p>Bahasa Indonesia is a language notorious for its extensive use of contextual and implicit meanings that rely on social and situational cues. <a href="https://bit.ly/3QFV6sh" target="_blank" rel="noopener">Samsung R&D Institute Indonesia</a> (SRIN) researchers went out into the field to record conversations in coffee shops and working environments to capture authentic ambient noises that could distort input. This helped the model learn to recognize the necessary information from verbal input, ultimately improving the accuracy of speech recognition.</p>
<p>There are many homonyms in Japanese as the number of sounds is limited in the language. So many words must be determined based on the context. <a href="https://bit.ly/3Yiy5jR" target="_blank" rel="noopener">Samsung R&D Institute Japan</a> (SRJ) used Samsung Gauss, the company’s internal LLM, structure contextual sentences with words or phrases relevant to each scenario to help the AI model differentiate between homonyms.</p>
<h3><span style="color: #000080"><strong>Samsung’s Global Research Network</strong></span></h3>
<p>The professionals across various Samsung R&D Institutes made full use of Samsung’s global research network.</p>
<p>Before tackling Hindi, SRI-B collaborated with teams around the world to develop AI language models for British, Indian and Australian English as well as Thai, Vietnamese and Indonesian. Engineers from other Samsung research centers visited Bangalore, India, to bring Vietnamese, Thai and Indonesian to Galaxy AI.</p>
<div id="attachment_154359" style="width: 1010px" class="wp-caption alignnone"><img aria-describedby="caption-attachment-154359" class="size-full wp-image-154359" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/07/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institutes-Around-the-World_main3.jpg" alt="" width="1000" height="667" /><p id="caption-attachment-154359" class="wp-caption-text">▲ Staff and collaborators pose in front of Samsung R&D Institute India-Bangalore (SRI-B)</p></div>
<p>SRPOL had extensive experience developing ASR, NMT and TTS models for a multitude of languages. A key player in Galaxy AI’s language expansion, SRPOL collaborated across continents to support SRJO with Arabic dialects and SRBR with Brazilian Portuguese and Latin American Spanish.</p>
<p>Samsung developers at each of these locations learned to collaborate across borders and time zones. Developers from SRIN even observed the local fasting customs in India when meeting their SRI-B colleagues. Many reflected on their work with pride and gratitude — realizing the lasting implications this project has on language, culture, heritage and identity.</p>
<h3><span style="color: #000080"><strong>Ongoing Efforts as the Journey Continues</strong></span></h3>
<p>Samsung recently <a href="https://bit.ly/4bz6a21" target="_blank" rel="noopener">introduced</a> Galaxy AI to its latest foldables and wearables. Since its release earlier this year, Galaxy AI has already been used on more than 100 million devices. “We’re expecting to reach 200 million devices by the end of 2024,” said Won-joon Choi, EVP and Head of the Mobile R&D Office, Mobile eXperience Business at Samsung Electronics at a recent <a href="https://bit.ly/3zEed0a" target="_blank" rel="noopener">panel discussion</a>.</p>
<p>Amidst this mission to democratize AI, it is important to look back and celebrate the accomplishments and progress that have led to providing this safe and inclusive technology that will benefit humanity and improve lives. By building up the Galaxy AI ecosystem with even more features, languages and regional variations, Samsung is facilitating cross-cultural exchanges in unprecedented ways to realize its vision of AI for All.</p>
]]></content:encoded>
																				</item>
					<item>
				<title>The Learning Curve, Part 8: Creating Conversations From Japan to the World</title>
				<link>https://news.samsung.com/global/the-learning-curve-part-8-creating-conversations-from-japan-to-the-world</link>
				<pubDate>Fri, 12 Jul 2024 17:00:16 +0000</pubDate>
								<media:content url="https://img.global.news.samsung.com/global/wp-content/uploads/2024/07/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-Japan_thumb728.jpg" medium="image" />
				<dc:creator><![CDATA[Samsung Newsroom]]></dc:creator>
						<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Galaxy AI]]></category>
		<category><![CDATA[Interpreter]]></category>
		<category><![CDATA[Live Translate]]></category>
		<category><![CDATA[Samsung R&D Institute]]></category>
		<category><![CDATA[Samsung R&D Institute Japan]]></category>
		<category><![CDATA[Text-to-speech]]></category>
		<category><![CDATA[The Learning Curve]]></category>
                <guid isPermaLink="false">https://bit.ly/3Yiy5jR</guid>
									<description><![CDATA[As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline thanks to on-device translation in features such as […]]]></description>
																<content:encoded><![CDATA[<p>As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline thanks to on-device translation in features such as Live Translate, Interpreter, Note Assist and Browsing Assist. But what does AI language development involve? Last time, we visited <span><a href="https://news.samsung.com/global/the-learning-curve-7-poland-collaboration-and-communication-across-european-borders-and-cultures" target="_blank" rel="noopener">Poland</a></span> to discover how European countries collaborate to accomplish their goal. This time, we’re in Japan to see how developers are constantly adapting to new scenarios and use cases.</p>
<p>Samsung R&D Institute Japan (SRJ) was established as an R&D center focused on hardware such as home appliances and displays. With the demand for AI innovation ramping up globally, SRJ in Yokohama has also been operating a software development lab to create Galaxy AI’s Live Translate, which automatically translates voice calls in real time, since the end of last year.</p>
<p>“<span>Live Translate</span> is particularly efficient for travel scenarios such as visitors to this year’s Olympic Games in Paris,” says Takayuki Akasako, the Head of Artificial Intelligence at SRJ. “We are currently developing a speech recognition program for people who are both sightseeing and watching the Paris Olympic Games; by training the speech recognition program to learn about the games and locations of stadiums for Paris 2024.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-153830" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/07/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-Japan_main1.jpg" alt="" width="1000" height="667" /></p>
<h3><span style="color: #000080"><strong>Understanding Context in Voice Recognition</strong></span></h3>
<p>For those already using the translation features of Galaxy AI, such functionalities may seem very useful. But for developers who have made the features come to life, they know that being able to communicate while traveling abroad isn’t something that can be taken for granted.</p>
<p>One thing the team noted was that there are more homonyms in Japanese than some other languages. For instance, ‘chopsticks’ (Hashi,箸) and ‘bridge’ (Hashi,橋) are relatively easy to distinguish due to the difference in intonation, but words like ‘sightseeing’(Kankō,観光), ‘customs’(Kankō,慣行), ‘public’ (Kōkyō,公共) and ‘prosperity’ (Kōkyō,好況) must be judged based on the context.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-153839" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/07/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-Japan_main2_Final.jpg" alt="" width="1000" height="667" /></p>
<p>“Judgement becomes more difficult when the context is ambiguous, such as names of locale and people, proper nouns, dialects and numbers,” says Akasako. “So in order to improve the accuracy of speech recognition, a lot of data is needed.”</p>
<p>“We always look for ways to fine-tune the AI model for key events and moments in a timely manner,” continues Akasako. “With a lot of new combinations of place names and activities, it’s important that the context is still clear when people are using Galaxy AI.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-153837" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/07/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-Japan_main3_Final.jpg" alt="" width="1000" height="667" /></p>
<h3><span style="color: #000080"><strong>Challenges in Collecting Efficient Data</strong></span></h3>
<p>While recognizing the types of data needed is also important, collecting the data in and of itself is a challenge in its own right.</p>
<p>Previously, the SRJ team used human-recorded data to train the speech recognition engine for Live Translate, which didn’t result in sufficient data collection.</p>
<p>Samsung Gauss, the company’s Large Language Model (LLM), uses scripts to structure sentences with words or phrases that are relevant to each scenario. The data collected with Samsung Gauss is not only recorded by humans, but also generated by a speech synthesis text-to-speech (TTS) data, through which human resources do the final check on the quality. Using this method, the team has seen a dramatic improvement in data collection efficiency.</p>
<p>“Every time a problem is identified and solved, the accuracy of speech recognition improves significantly,” says Akasako. “Regardless of where people are, our goal is connecting people with each other, and the tools powered by Galaxy AI will ensure more fun and efficient communication.”</p>
]]></content:encoded>
																				</item>
					<item>
				<title>The Learning Curve 7 — Poland: Collaboration and Communication Across European Borders and Cultures</title>
				<link>https://news.samsung.com/global/the-learning-curve-7-poland-collaboration-and-communication-across-european-borders-and-cultures</link>
				<pubDate>Thu, 04 Jul 2024 17:00:58 +0000</pubDate>
								<media:content url="https://img.global.news.samsung.com/global/wp-content/uploads/2024/07/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-Poland_thumb728.jpg" medium="image" />
				<dc:creator><![CDATA[Samsung Newsroom]]></dc:creator>
						<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Galaxy AI]]></category>
		<category><![CDATA[Interpreter]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[Live Translate]]></category>
		<category><![CDATA[Samsung R&D Institute]]></category>
		<category><![CDATA[Samsung R&D Institute Poland]]></category>
		<category><![CDATA[Text-to-speech]]></category>
		<category><![CDATA[The Learning Curve]]></category>
                <guid isPermaLink="false">https://bit.ly/3W89pJa</guid>
									<description><![CDATA[As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as […]]]></description>
																<content:encoded><![CDATA[<p>As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as Live Translate, Interpreter, Note Assist and Browsing Assist. But what does AI language development involve? Last time, we visited <a href="https://news.samsung.com/global/the-learning-curve-part-6-the-collaborative-path-to-ai-innovation" target="_blank" rel="noopener">India</a> to learn how teams collaborate with students and universities to bring Galaxy AI to more people. This time, we’re in Poland to discover how European countries collaborate to accomplish their goal.</p>
<p>There’s a saying at the Samsung R&D Institute Poland (SRPOL): “<em>A day at SRPOL lasts 96 hours”</em>. It refers to the center’s global role as one of the largest and fastest-growing R&D centers in the region, often working across four different time zones. Sitting at the heart of Europe while covering many European and global markets, SRPOL has worked on automatic speech recognition, neural machine translation and text-to-speech models for more than 30 languages. When it came to bringing 10 languages to Galaxy AI, this expertise meant the team was well suited to seamlessly blend cultural perspectives with Samsung’s global technology.</p>
<p>SRPOL has years of experience in Natural Language Processing. What makes it unique is its adaptability to work on any language thanks to the passionate team and their tools, such as a crowdsourcing platform that enables fast and agile development.</p>
<p>“Collaboration across the continent means relentless data collection, annotation and research, which has become something we really enjoy,” says Kornel Jankowski, Head of Speech Decoding at SRPOL. “We’ve dealt with so many languages that our team developed universal, language-agnostic skills. When we’re asked to support a new language model, everybody’s attitude is: <em>Oh wow, we get to learn another one, that’s going to be fun!</em>”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-153358" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/07/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-Poland_main1.jpg" alt="" width="1000" height="625" /></p>
<h3><span style="color: #000080"><strong>A European Center for AI Language Development</strong></span></h3>
<p>Language is a cornerstone of culture and communication across Europe regardless of whether it’s incorporated into technology. However, it presents unique challenges for the team at SRPOL, who develop AI models for European languages.</p>
<p>“Each language and the culture that it is part of, comes with hurdles that make us reevaluate how we perceive a specific issue,” explains Adam Ros, Head of Artificial Intelligence at SRPOL. These hurdles include navigating the untranslatability of certain phrases and handling idiomatic expressions that may not have direct equivalents in other languages.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-153359" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/07/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-Poland_main2.jpg" alt="" width="1000" height="563" /></p>
<p>The team saw these challenges as an opportunity to make SRPOL a European center for AI language development. The biggest benefit of this is that it shortens the communication path between different departments and crucially, the decision-making path. Whether it is a matter of automatic speech recognition, neural machine translation or text-to-speech, teams could simply walk over to colleagues in Mobile Quality Assurance and efficiently solve problems together.</p>
<p>While this has helped, it hasn’t overcome all AI challenges. Inevitably, there are limitations in AI models when dealing with multiple European languages, such as translating without context or variations in intonation. However, the team saw these as an opportunity to keep learning and innovating.</p>
<p><span>“My team never stops at just one example when handling a new word or topic. Some European languages are harder than others,” adds Ros. If you’ve ever been to Spain, you know that Spanish is often spoken at blazing-fast speeds and we need to train AI well to handle that.”</span></p>
<p>Galaxy AI’s expansion required novel cross-continent collaboration, but the work soon grew beyond European borders. SRPOL supported the Jordan team’s efforts to teach Galaxy AI Arabic’s myriad of dialects, as well as the Brazil team’s work on Latin American languages.</p>
<p>The importance of language and cultural difference subtleties are all on the radar of SRPOL product developers because they can all be noticed by the target — the end users.</p>
<p>“There are subtle differences between European cultures that impact whether something feels natural to the end user. For example, people in some countries expect to read prices with the euro symbol (€), while others are accustomed to seeing it spelled out, e-u-r-o-s,” says Agata Maria Rozycka, Head of Voice Intelligence Research at SRPOL. “If this cultural nuance is not reflected translated text, the interface might seem less intuitive to a user. Implementing these micro-level insights into interface design can make technology feel more natural across diverse cultures.”</p>
<p>“The team has been remotely communicating and collaborating across different countries for many years, building up numerous effective communication channels,” says Marcin Mrugala, Head of Mobile Quality Assurance at SRPOL. “We were ready to do our part in enabling Galaxy AI to lower language barriers around the world.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-153360" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/07/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-Poland_main3.jpg" alt="" width="1000" height="462" /></p>
<h3><span style="color: #000080"><strong>Technology for Bridging Cultures</strong></span></h3>
<p>Managing and integrating diverse linguistic and cultural insights is a challenging task, but it is essential for Samsung’s vision for Galaxy AI — lowering the barriers that divide people based on language and culture, and enabling them to create deeper connections.</p>
<p><span>“We’re not just building technology of the future, we’re building teams of the future too. Our best practices are designed to refine products based on differences across countries, but we fundamentally believe our similarities far outweigh our differences and our technology can unite cultures,” says Mrugala.</span></p>
<p>“Our goal is to bring people together, to make their lives easier, and to simplify their daily tasks. We’re seeing our families using the Voice Recorder in new ways, and we can now call our friends and different countries and talk with them in their own language. It is magical to see this change in the world and to be part of it. Galaxy AI brought SRPOL people together and now we are bringing together the world,” concludes Rozycka.</p>
]]></content:encoded>
																				</item>
					<item>
				<title>The Learning Curve, Part 6: The Collaborative Path to AI Innovation</title>
				<link>https://news.samsung.com/global/the-learning-curve-part-6-the-collaborative-path-to-ai-innovation</link>
				<pubDate>Fri, 28 Jun 2024 18:00:32 +0000</pubDate>
								<media:content url="https://img.global.news.samsung.com/global/wp-content/uploads/2024/06/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-India-Bangalore_thumb728.jpg" medium="image" />
				<dc:creator><![CDATA[Samsung Newsroom]]></dc:creator>
						<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Galaxy AI]]></category>
		<category><![CDATA[Interpreter]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[Live Translate]]></category>
		<category><![CDATA[Samsung R&D Institute]]></category>
		<category><![CDATA[Samsung R&D Institute India-Bangalore]]></category>
		<category><![CDATA[Text-to-speech]]></category>
		<category><![CDATA[The Learning Curve]]></category>
		<category><![CDATA[Vellore Institute of Technology]]></category>
                <guid isPermaLink="false">https://bit.ly/3znJWmb</guid>
									<description><![CDATA[As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as […]]]></description>
																<content:encoded><![CDATA[<p>As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as Live Translate, Interpreter, Note Assist and Browsing Assist. But what does AI language development involve? Last time, we visited <a href="https://news.samsung.com/global/the-learning-curve-part-5-overcoming-multicultural-and-multilingual-differences">Brazil</a> to learn how teams work across cultures and borders to bring Galaxy AI to more people. This time, we’re in India to discover the value of cooperating with local partners.</p>
<p>Hidden inside the Vellore Institute of Technology in Chennai, India, is a lab filled with futuristic audio equipment. One will find mannequins — known in the industry as head and torso simulators — as well as binaural microphones and hearing devices. They are stored in special chambers treated with an advanced sound absorption system, making this lab the first of its kind in India. Imagine such a facility is used to develop the latest high-end high fidelity (Hi-Fi) equipment.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-153252" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/06/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-India-Bangalore_main1.jpg" alt="" width="1000" height="667" /></p>
<p>This is where the Vellore Institute of Technology collaborates with Samsung to produce and develop data and insights that power the latest AI models for Galaxy AI’s language capabilities. The facility was developed as part of Samsung SEED (Students Ecosystem for Engineered Data) Labs — an initiative that enables university staff, students and interns in India to work on projects requested by Samsung since 2021. This is just one of several university programs funded by Samsung in which students have the opportunity to work on projects with technical experts from the company.</p>
<p>“As a student, I love being able to work on multiple projects with a well-known and respected company such as Samsung,” says Yashika Ilanchezhiyan, a Samsung SEED student. “I’m given the confidence to learn new skills in a practical way and feel like I’m making a real difference in current and future products.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-153253" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/06/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-India-Bangalore_main2.jpg" alt="" width="1000" height="667" /></p>
<p>“This kind of collaboration is a win-win situation,” says Giridhar Jakki, Head of Language AI at Samsung R&D Institute India – Bangalore (SRI-B). “Thanks to our projects with universities, we are able to access additional expertise and custom datasets. Partnering universities receive investment, financial incentives and expert mentorship from Samsung as a result.”</p>
<h3></h3>
<h3><span style="color: #000080"><strong>Lowering Language Barriers</strong></span></h3>
<p><span>SRI-B has collaborated with teams </span><span>around the world to develop AI language models for British, Indian and Australian English</span><span> as well as Thai, Vietnamese and Indonesian. </span>Recently, core engineers from other Samsung Research centers visited Bangalore, India <span>—</span> where the SRI-B team helped ramp up the technology to bring Vietnamese, Thai and Indonesian to Galaxy AI. SRI-B was therefore ideally positioned to develop the Hindi language for Galaxy AI.</p>
<p><span>“Every language has its challenges,</span>”<span> says </span>Jakki<span>.</span> “B<span>ut when you consider the end goal</span> of bringing people <span>the ability to communicate in other languages</span>,<span> it’s worth every ounce of effort. We couldn’t wait to bring Hindi to Galaxy AI</span>.<span>”</span></p>
<p><img loading="lazy" class="alignnone size-full wp-image-153254" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/06/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-India-Bangalore_main3.jpg" alt="" width="1000" height="667" /></p>
<p>Developing the Hindi AI model wasn’t simple. The team had to ensure more than 20 regional dialects, tonal inflections, punctuation and colloquialisms were covered. Additionally, it is common for Hindi speakers to mix English words in their conversations. This required the team to carry out multiple rounds of AI model training with a combination of translated and transliterated data.</p>
<p>“Hindi has a complex phonetic structure that includes retroflex sounds <span>—</span> <span>sounds made by curling the tongue back in the mouth —</span> <span>which</span> are not present in many other languages,” says Jakki. “To build the speech synthesis element of the AI solution, we carefully reviewed data with native linguists <span>to understand all the unique sounds and created a special set of </span>phenomes<span> to </span>support specific dialects <span>of the language</span>.”</p>
<p>Collaborative efforts between Samsung and academic partners were instrumental in developing the AI language model that reflected the cultural nuances of the India’s regions. The Vellore Institute of Technology helped secure almost a million lines of segmented and curated audio data on conversational speech, words and commands. Data was a crucial component for a task as critical as incorporating the fourth most spoken language in the world into Galaxy AI. Working with universities ensured Samsung was using the highest quality data.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-153255" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/06/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-India-Bangalore_main4.jpg" alt="" width="1000" height="667" /></p>
<h3><span style="color: #000080"><strong>Global Connections Deliver Big Impact</strong><strong>s</strong></span></h3>
<p><span>This project perfectly encapsulates Samsung’s philosophy of open collaboration and the </span><span>company’s belief that sharing expertise and perspectives ensure</span><span>s meaningful innovation.</span> In the case of SRI-B, this not only includes working with academia but also sharing insights and best practices with other Samsung research centers around the world.</p>
<p>“I’m extremely proud of what we’ve achieved with the help of our partners,” says Jakki. “AI innovation through collaboration is a big part of what we do. We will continue to better understand, collect and analyze language data so more people can have access to AI tools in the future.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-153256" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/06/Samsung-Mobile-Galaxy-AI-Samsung-RD-Institute-India-Bangalore_main5.jpg" alt="" width="1000" height="667" /></p>
]]></content:encoded>
																				</item>
					<item>
				<title>The Learning Curve, Part 5: Overcoming Multicultural and Multilingual Differences</title>
				<link>https://news.samsung.com/global/the-learning-curve-part-5-overcoming-multicultural-and-multilingual-differences</link>
				<pubDate>Wed, 12 Jun 2024 17:00:49 +0000</pubDate>
								<media:content url="https://img.global.news.samsung.com/global/wp-content/uploads/2025/06/Learning-Curve_Brazil_Thumb728.jpg" medium="image" />
				<dc:creator><![CDATA[Samsung Newsroom]]></dc:creator>
						<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Galaxy AI]]></category>
		<category><![CDATA[Interpreter]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[Live Translate]]></category>
		<category><![CDATA[Samsung R&D Institute]]></category>
		<category><![CDATA[Samsung R&D Institute Brazil]]></category>
		<category><![CDATA[Text-to-speech]]></category>
		<category><![CDATA[The Learning Curve]]></category>
                <guid isPermaLink="false">https://bit.ly/45jblBF</guid>
									<description><![CDATA[As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as […]]]></description>
																<content:encoded><![CDATA[<p>As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as Live Translate, Interpreter, Note Assist and Browsing Assist. But what does AI language development involve? Last time, we visited <a href="https://news.samsung.com/global/the-learning-curve-part-4-a-new-ai-model-and-an-evolving-language" target="_blank" rel="noopener">China</a> to learn about the importance of partnering with other leaders in AI. This time, we’re in Brazil to explore how teams work across cultures and borders to bring Galaxy AI to more people.</p>
<p>A diverse country with more than 203 million people embodying a wide range of cultures and traditions, Brazil uses Brazilian Portuguese as its official language. Meanwhile, 22 neighboring countries use Latin American Spanish.</p>
<p>Although Brazilian Portuguese and Latin American Spanish are widely spoken, intricate variations in both languages presented various challenges when teaching Galaxy AI to discern and distinguish regional differences. That’s why Samsung R&D Institute Brazil (SRBR) collaborated with Samsung experts from Mexico — as well as third-party partners such as the science and technology institutes SiDi and Sidia — to assemble a multidisciplinary and highly skilled team that could tackle the task.</p>
<h3><span style="color: #000080"><strong>Lower Barriers, Higher Understanding</strong></span></h3>
<p>The team used thousands of sources and a combination of machine learning and language processing tools to improve the AI model’s recognition of speech, written texts and regional variations. But local jargon and names of famous figures — including sports teams, celebrities and bands — vary widely between regions. Also, the same meaning can be expressed in many different words. While language models need localized data to gain a comprehensive understanding of the different languages to be translated, such variations inevitably present obstacles.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152805" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/06/Learning-Curve_Brazil_main1.jpg" alt="" width="1000" height="563" /></p>
<p>For example, swimming pool is “alberca” in Mexico — but it is “pileta” in Argentina, Paraguay and Uruguay. Meanwhile, in Colombia, Bolivia and Venezuela, swimming pool is “piscina”, which is also used in Brazil but with a slight tonal difference. And while Colombians might say “chévere” to refer to something cool, Mexicans instead say “padre.”</p>
<p>These differences represent huge challenges for AI language understanding and learning, but the team overcame them by building larger language models, refining processing tools — and collaborating across borders and time zones.</p>
<p>“We had to consider local slang and different ways of speaking before adapting and testing the model accordingly, which required close collaboration between the SRBR quality assurance (QA) team and development teams,” says Mateus Pedroso, Senior Manager and Head of Software Quality Lab at SRBR. “Since SRBR is located three hours ahead of the QA team in Mexico and 12 hours behind the management team in Korea, we had to create new communication channels and processes to align results and share progress. This multicultural collaboration generated a fiesta of ideas and solutions for Galaxy AI.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152806" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/06/Learning-Curve_Brazil_main2.jpg" alt="" width="1000" height="667" /></p>
<h3><span style="color: #000080"><strong>Communicating Success</strong></span></h3>
<p>Samsung’s philosophy of open collaboration came to life during this regional project as it was an iterative process that leveraged evolving technology on a global scale. To overcome linguistic and cultural barriers, the SRBR team needed to collect and manage massive amounts of data — continually refining and improving upon audio and text sources.</p>
<p>The teams carved out key areas of responsibility to ensure everyone could benefit from the collective skill sets across the company’s Latin American offices. The SRBR development team served as the intermediate stakeholder of the project, receiving directions from Samsung’s headquarters and developing new updates to improve the AI model while carrying out tests for numerous use cases.</p>
<p>“The testing phase required extensive communication and collaboration with QA teams to optimize the user experience, and each adjustment needed further testing and review,” says Leandro Flores de Moura, Software Development Manager at SiDi. “The success of Galaxy AI’s language capabilities is built on communication and collaboration as much as it is on technical expertise” adds Nathan Castro, QA Test Developer at SiDi.</p>
<h3><span style="color: #000080"><strong>A Roadmap for Culture</strong></span></h3>
<p>What makes Galaxy AI particularly interesting for everyone involved is the fact that this wasn’t merely a language project. To them, language is a cultural guide that provides valuable insight into people’s heritage and identity.</p>
<p>“For SiDi’s QA team, this was an endeavor that will change the world by enabling cultures to come together and overcome the difficulty of communicating in different languages,” adds Estefanía Castro Suárez, Test Developer at SiDi. “Knowing we were part of this fills us with pride and motivation.”</p>
<p>“The way the SRBR team collaborated exemplifies what Galaxy AI sets out to achieve — making the world a smaller place through communicating, sharing and interacting with people, even those who speak different languages,” concludes Pedroso. “This capability will only grow as more languages come on board with Galaxy AI.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152807" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/06/Learning-Curve_Brazil_main3.jpg" alt="" width="1000" height="667" /></p>
]]></content:encoded>
																				</item>
					<item>
				<title>The Learning Curve, Part 4: A New AI Model and an Evolving Language</title>
				<link>https://news.samsung.com/global/the-learning-curve-part-4-a-new-ai-model-and-an-evolving-language</link>
				<pubDate>Tue, 04 Jun 2024 17:00:35 +0000</pubDate>
								<media:content url="https://img.global.news.samsung.com/global/wp-content/uploads/2025/06/Learning-Curve_China_Thumb728.jpg" medium="image" />
				<dc:creator><![CDATA[Samsung Newsroom]]></dc:creator>
						<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Galaxy AI]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[Live Translate]]></category>
		<category><![CDATA[Samsung Electronics Hong Kong]]></category>
		<category><![CDATA[Samsung R&D Institute]]></category>
		<category><![CDATA[Samsung R&D Institute China]]></category>
		<category><![CDATA[The Learning Curve]]></category>
                <guid isPermaLink="false">https://bit.ly/4c1YkhU</guid>
									<description><![CDATA[As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as […]]]></description>
																<content:encoded><![CDATA[<p><img loading="lazy" class="alignnone size-full wp-image-152471" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/06/Learning-Curve_China_main1.jpg" alt="" width="1000" height="667" /></p>
<p>As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as Live Translate, Interpreter, Note Assist and Browsing Assist. But what does AI language development involve? Last time, we visited <a href="https://news.samsung.com/global/the-learning-curve-part-3-taking-ai-data-from-good-to-great" target="_blank" rel="noopener">Vietnam</a> to learn about preparing the data that is used to train AI models. This time, we’re seeing how teams made Galaxy AI a unique offering for both the Chinese mainland and Hong Kong.</p>
<p>The rapid growth in AI tools that use large language models (LLM) has been seen worldwide, and China is no exception. With Baidu’s ERNIE Bot and Meitu’s MiracleVision emerging as popular choices in China, Samsung R&D Institute China partnered with both companies to help build Galaxy AI features for the country.</p>
<p>Samsung R&D Institute China in Guangzhou (SRC-G) and Beijing (SRC-B) worked to ensure Mandarin speakers in China had the same Galaxy AI experience as other users around the world, despite the back-end technology looking very different. The team took advantage of the dedicated resources of Chinese dialects from third-party partners and built a unique Galaxy AI solution for China.</p>
<p>“We have the advantage of blending global best practices with China’s local practices, as well as creating new features and constantly improving them through daily communication with Chinese consumers,” says Hairong Zhang, Software Innovation Group Leader at SRC-G. “With rich development experience from the Galaxy S24, I’m proud of how our team cooperated with local Chinese AI companies such as Baidu and Meitu to provide a solution that resonates in China.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152472" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/06/Learning-Curve_China_main2.jpg" alt="" width="1000" height="656" /></p>
<p>At the beginning, the teams had to acclimate to each other’s working styles and iron out the initial kinks of information asymmetry. Daijun Zhang, Head of SRC-B, established a task force to ensure the project followed the development schedule and moved quickly toward its goals.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152473" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/06/Learning-Curve_China_main3.jpg" alt="" width="1000" height="667" /></p>
<p>Thanks to the Beijing team’s experience in generating large-scale models and successful collaboration with third-party partners, all the generative AI features were successfully launched in China. The result is a solution that has local relevance and market-specific features such as Touch to Search.</p>
<h3><span style="color: #000080"><strong>Expanding on Chinese To Develop for the Cantonese Dialect</strong></span></h3>
<p>Chinese for mainland China (Mandarin) arrived on Galaxy AI with the launch of the Galaxy S24 in January 2024. But the job for Samsung R&D Institute China was far from finished. The team was also tasked with developing the AI model for Chinese in Hong Kong (Cantonese), a dialect that builds on the work already carried out for Mandarin but brings an entirely new set of language features to address.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152474" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/06/Learning-Curve_China_main4.jpg" alt="" width="1000" height="667" /></p>
<p>In developing for Cantonese, the China R&D team faced major cultural challenges that it needed to respond to in order to fully support localization for the market. The first cultural phenomenon is the two sets of systems for writing and speech. Hong Kong locals use grammar and expressions similar to Mandarin when writing but adopt a completely different colloquial grammar when communicating daily. Also, Cantonese has nine tones for pronunciation, whereas Mandarin has four.</p>
<p>Another cultural phenomenon is that the Cantonese dialect itself develops with the times. Add to that the fact that people often blend Cantonese and English into conversations, and it’s clear to see why it was complicated to create test cases and validate language packs.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152478" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/06/Learning-Curve_China_main5.jpg" alt="" width="1000" height="571" /></p>
<p>“Cantonese is a very unique dialect that varies in different Cantonese-speaking regions,” says Jing Li, who leads the operation for testing the Cantonese AI solution. “Some of the slang, phrases, vocabulary and even the tones are varied from place to place. Therefore, we conducted a large amount of work in verifying the Hong Kong-specific data, as well as proofreading tens of thousands of relevant test cases.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152475" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/06/Learning-Curve_China_main6.jpg" alt="" width="1000" height="667" /></p>
<p>With these complexities in mind, SRC-G and SRC-B worked together to support a deep code mix using a mixture of Cantonese and English for speech recognition, simultaneously supporting both written and spoken expressions in machine translation and reflecting current pronunciations in speech synthesis.</p>
<h3><span style="color: #000080"><strong>Cultural Impact of Communication</strong></span></h3>
<p>When Galaxy AI launched the Chinese (Hong Kong) language option, the customer feedback showed that the hard work of the Samsung R&D team was justified.</p>
<p>For both the Chinese mainland and Hong Kong, Samsung’s Galaxy AI activities show the importance of a global brand having a local presence and expertise, as well as the power of open collaboration with other organizations. In Hong Kong, Cantonese is a key part of the cultural identity of those who live there. That’s why it was so important for the team to get the AI language model right.</p>
<p>“Language and communication are crucial in every region and in all walks of life,” says Henry Wat, Heads of Engineering Group at Samsung Electronics Hong Kong. “No matter the language, any tool that helps people communicate is invaluable. I believe our work is meaningful.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152476" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/06/Learning-Curve_China_main7.jpg" alt="" width="1000" height="750" /></p>
<p>In the next episode of The Learning Curve, we will head to Brazil to see how a team works across cultures and borders to bring Galaxy AI to more people.</p>
]]></content:encoded>
																				</item>
					<item>
				<title>The Learning Curve, Part 3: Taking AI Data From Good to Great</title>
				<link>https://news.samsung.com/global/the-learning-curve-part-3-taking-ai-data-from-good-to-great</link>
				<pubDate>Thu, 23 May 2024 17:00:16 +0000</pubDate>
								<media:content url="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve_Part-3_Thumb728.jpg" medium="image" />
				<dc:creator><![CDATA[Samsung Newsroom]]></dc:creator>
						<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Automatic speech recognition]]></category>
		<category><![CDATA[Galaxy AI]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[Live Translate]]></category>
		<category><![CDATA[Samsung R&D Institute]]></category>
		<category><![CDATA[Samsung R&D Institute Vietnam]]></category>
		<category><![CDATA[Text-to-speech]]></category>
		<category><![CDATA[The Learning Curve]]></category>
                <guid isPermaLink="false">https://bit.ly/3yxPn1t</guid>
									<description><![CDATA[Samsung is pioneering premium mobile AI experiences. To learn how Galaxy AI is maximizing the potential of its users, we are visiting Samsung Research centers around the world. Now supporting 16 languages, Galaxy AI is enabling more people to expand their language capabilities, even when offline, thanks to on-device translation in features such as Live […]]]></description>
																<content:encoded><![CDATA[<p><img loading="lazy" class="alignnone size-full wp-image-152111" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve_Part-3_main1.jpg" alt="" width="1000" height="667" /></p>
<p>Samsung is pioneering premium mobile AI experiences. To learn how Galaxy AI is maximizing the potential of its users, we are visiting Samsung Research centers around the world. Now supporting 16 languages, Galaxy AI is enabling more people to expand their language capabilities, even when offline, thanks to on-device translation in features such as Live Translate, Interpreter, Note Assist and Browsing Assist. We recently visited <a href="https://news.samsung.com/global/the-learning-curve-part-2-how-to-build-an-ai-for-diverse-dialects" target="_blank" rel="noopener">Jordan</a> to learn the complexities of developing an AI model for Arabic, a language with many dialects. This time, we’re going to Vietnam to explore how data is prepared to train AI models.</p>
<p>What is the difference between a ghost, grave and mother in Vietnamese? For a language spoken by 97 million people worldwide, very little. Each word translates to “ma,” “mả” and “má,” respectively — and can only be distinguished by tone. This illustrates how difficult it can be for AI models to learn a language, considering they cannot recognize firsthand the context and emotions of conversations nor the intentions of those speaking.</p>
<p>Samsung R&D Institute Vietnam (SRV) used finely refined data to help its AI model properly recognize even the most subtle differences in language.</p>
<p>The quality of data used directly affects the accuracy of automatic speech recognition (ASR), neural machine translation (NMT) and text-to-speech (TTS) — processes that help Galaxy AI features such as Live Translate, Interpreter, Chat Assist and Browsing Assist break down language barriers.</p>
<h3><span style="color: #000080"><strong>A Typhoon of Challenges</strong></span></h3>
<p>“Vietnamese is a complex and diverse language with rich expressions, many of which are challenging to capture,” says Ngô Hồng Thái, NMT lead at SRV. Of the 16 languages that Galaxy AI supports, Vietnamese was particularly difficult to develop.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152112" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve_Part-3_main2.jpg" alt="" width="1000" height="667" /></p>
<p>“Personally, creating an AI model for Vietnamese was more daunting than our typhoons!” he adds before explaining the hurdles faced during the development process.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152113" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve_Part-3_main3.jpg" alt="" width="1000" height="571" /></p>
<p><span>Vietnamese</span> <span>is a tonal language </span><span>with six distinct tones. As evident in the “ma” example above, small nuances in vocalization can drastically alter the meanings of words. Therefore, a </span>meticulous and <span>detailed approach was necessary.</span></p>
<p>“When similar sounding words are broken down, one word consists of several short segments, or ‘frame sets’,” says Bui Ngoc Tung, ASR lead at SRV. “The AI model differentiates between the short audio frames of around 20 milliseconds to recognize what words correspond to a certain set of consecutive frames. As such, it is critical to put great effort into the early stages of the AI learning process.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152114" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve_Part-3_main4.jpg" alt="" width="1000" height="667" /></p>
<p>Furthermore, homophones and homonyms are common in Vietnamese. People can normally rely on context and nonverbal elements in conversations to differentiate between words that sound the same or are written the same but have different meanings. However, AI models need to be taught to accurately identify and differentiate between tones and similar words.</p>
<p>“This isn’t a straightforward task,” Thái explains. “Apart from the amount, the data needs to be accurate to ensure it is capable of recognizing the linguistic nuances that exist in Vietnamese.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152115" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve_Part-3_main5.jpg" alt="" width="1000" height="667" /></p>
<h3><span style="color: #000080"><strong>Rigorous Pr</strong><strong>eparation</strong></span></h3>
<p>The data refinement process consists of three steps. First, the audio and text used to train the AI model must be reviewed and corrected. Then, this dataset goes through random checks for overall quality. Finally, the dataset is normalized and cleaned before use in training.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152135" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve_Part-3_main06.jpg" alt="" width="1000" height="615" /></p>
<p>“We thoroughly performed a series of tests to check the accuracy of our dataset,” says Nguyen Manh Duy, TTS lead at SRV who oversees database creation. “We faced a number of unexpected problems including misspelled words in scripts and background noise or incorrect pronunciation during audio recordings. We spent significant time refining and improving our training data.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152116" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve_Part-3_main7.jpg" alt="" width="1000" height="667" /></p>
<p>A vital part of the data refinement process and the journey of taking AI data from good to great is the work of the Software Quality Engineering (SQE) team. The team plays an important role in testing and improving AI language data quality and they work closely with the AI language development project team to make it happen.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152319" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/05/Learning-Curve_Part-3_main10.jpg" alt="" width="1000" height="667" /></p>
<p>In addition to the<span> unique linguistic challenges</span> in Vietnamese<span>, </span>there is a <span>lack of universally accessible data compared to more widely spoken languages. “This is another reason why the data refinement stage is so important,”</span> he<span> adds. “Since we had limited sources, every piece of </span>data<span> had to be fully reliable. There was no margin for error</span>.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152117" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve_Part-3_main8.jpg" alt="" width="1000" height="679" /></p>
<p>Moreover, the AI model for Vietnamese must consider both tonal and regional differences. To improve the AI model’s accuracy, the team collected vast amounts of data with Vietnam’s northern, central and southern accents — resulting in an enormous amount of information to refine and verify.</p>
<h3><span style="color: #000080"><strong>Continued Improvement</strong></span></h3>
<p>Developers at SRV completed the project after months of hard work, and Vietnamese became one of the first languages to be supported by Galaxy AI. Despite this success, the team is ceaselessly working to improve the Vietnamese Galaxy AI experience.</p>
<p>“We’re continuing to enhance the AI model by incorporating user feedback about the relevance of words and phrases in Galaxy AI,” says Tran Tuan Minh, leader of the AI language development project at SRV. “We have just taken our first steps into a more open world  —  and we have so much more to explore together.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-152318" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/05/Learning-Curve_Part-3_main09.jpg" alt="" width="1000" height="667" /></p>
<p>In the next episode of The Learning Curve, we will head to China to dig into how AI models are trained and fine-tuned.</p>
]]></content:encoded>
																				</item>
					<item>
				<title>The Learning Curve, Part 2: How to Build an AI for Diverse Dialects</title>
				<link>https://news.samsung.com/global/the-learning-curve-part-2-how-to-build-an-ai-for-diverse-dialects</link>
				<pubDate>Thu, 16 May 2024 17:00:31 +0000</pubDate>
								<media:content url="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve-Part-2_AI_Thumb728.jpg" medium="image" />
				<dc:creator><![CDATA[Samsung Newsroom]]></dc:creator>
						<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Automatic speech recognition]]></category>
		<category><![CDATA[Galaxy AI]]></category>
		<category><![CDATA[language]]></category>
		<category><![CDATA[Live Translate]]></category>
		<category><![CDATA[Samsung R&D Institute]]></category>
		<category><![CDATA[Samsung R&D Institute Jordan]]></category>
		<category><![CDATA[Text-to-speech]]></category>
		<category><![CDATA[The Learning Curve]]></category>
                <guid isPermaLink="false">https://bit.ly/3V4yRyR</guid>
									<description><![CDATA[Galaxy AI now supports 16 languages, helping more people to lower language barriers with real-time and on-device translation. Samsung opened the door to a new era of mobile AI, so we are visiting Samsung Research centers all over the world to learn how Galaxy AI came to life and what it took to overcome the […]]]></description>
																<content:encoded><![CDATA[<p>Galaxy AI now supports 16 languages, helping more people to lower language barriers with real-time and on-device translation. Samsung opened the door to a new era of mobile AI, so we are visiting Samsung Research centers all over the world to learn how Galaxy AI came to life and what it took to overcome the challenges of AI development. While part one of the series examines the task of determining what data is needed, this installment looks at the complex task of accounting for dialects.</p>
<p>Teaching a language to an AI model is a complex process, but what if it isn’t a singular language, but a collection of diverse dialects? That was the challenge faced by the team at Samsung R&D Institute Jordan (SRJO). While Arabic was added as a language option for Galaxy AI features such as Live Translate, the team had to cater to the various Arabic dialects that span the Middle East and North Africa, with each varying in pronunciation, vocabulary and grammar.</p>
<p>Arabic is one of the top six most widely spoken languages around the world, used daily by more than 400 million people.<sup>1</sup> The language is categorized into two forms: Fus’ha (Modern Standard Arabic) and Ammiya (the dialects of Arabic). Fus’ha is typically used in public and official events, as well as in news broadcasts, while Ammiya is more commonly used for day-to-day conversations. Over 20 countries use Arabic, and there are currently around 30 dialects in the region.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-151951" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve-Part-2_AI_main1.jpg" alt="" width="1000" height="667" /></p>
<h3><span style="color: #000080"><strong>Unwritten Rules</strong></span></h3>
<p>Recognizing the variation presented by these dialects, the team at SRJO employed a range of techniques to discern and process the unique linguistic features inherent in each. This approach was crucial in ensuring that Galaxy AI could understand and respond in a way that accurately reflects the regional nuances.</p>
<p>“Unlike other languages, the pronunciation of the object in Arabic varies depending on the subject and verb in the sentence,” says Mohammad Hamdan, project leader of the Arabic language development team. “Our goal is to develop a model that understands all these dialects and can answer in standard Arabic.”</p>
<p>TTS is the component of Galaxy AI’s Live Translate feature that lets users interact with speakers of different languages by translating spoken words into written text, and then vocally reproducing them. The TTS team faced a unique challenge, caused by the quirk of working with Arabic.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-151952" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve-Part-2_AI_main2.jpg" alt="" width="1000" height="667" /></p>
<p>Arabic uses diacritics, which are guides for the pronunciation of words in some contexts, such as religious texts, poetry and books for language learners. Diacritics are widely understood by native speakers but absent in everyday writing. This makes it difficult for a machine to convert raw text into phonemes, the basic units of sound that are the building blocks of speech.</p>
<p>“There is a shortage of high-quality and reliable datasets that accurately represent how diacritics are correctly used,” explains Haweeleh. “We had to design a neural model that can predict and restore those missing diacritics with high accuracy.”</p>
<p>Neural models work similarly to human brains. To predict diacritics, a model needs to study lots of Arabic text, learn the language’s rules and understand how words are used in different contexts. For instance, the pronunciation of a word can vary greatly depending on the action or gender it describes. Extensive training from the team was the key to enhancing the Arabic TTS model’s accuracy.</p>
<h3><span style="color: #000080"><strong>Enhancing Understanding</strong></span></h3>
<p>The SRJO team also had to collect diverse audio recordings of the dialects from various sources, which had to be transcribed, focusing on unique sounds, words and phrases. “We assembled a team of native speakers in the dialects who were well-versed in the nuances and variations,” says Ayah Hasan, whose team was responsible for database creation. “They listened to the recordings and manually converted the spoken words into text.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-151953" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve-Part-2_AI_main3.jpg" alt="" width="1000" height="667" /></p>
<p>This work was crucial for enhancing the Automatic Speech Recognition (ASR) process so that Galaxy AI could handle the rich tapestry of Arabic dialects. ASR is pivotal in enabling Galaxy AI’s real-time understanding and response capabilities.</p>
<p>“Building an ASR system that supports multiple dialects in a single model is a complex undertaking,” says Mohammad Hamdan, ASR lead for the project. “It demands a thorough understanding of the language’s intricacies, careful data selection and advanced modeling techniques.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-151954" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve-Part-2_AI_main4.jpg" alt="" width="1000" height="667" /></p>
<h3><span style="color: #000080"><strong>The Culmination of Innovation</strong></span></h3>
<p>After months of planning, building and testing, the team was ready to release Arabic as a language option for Galaxy AI, enabling many more people to communicate across borders. This single team has made Galaxy AI services accessible to Arabic speakers, lowering the language and cultural barriers between them and people all over the world. In doing so, they have established new best practices that can be rolled out globally. This success is only the beginning: the team continues to refine their models and enhance the quality of Galaxy AI’s language capabilities.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-151955" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve-Part-2_AI_main5.jpg" alt="" width="1000" height="667" /></p>
<p>In the next episode, we go to Vietnam to see how the team makes language data better. Plus, what does it take to train an effective AI model?</p>
<p>Arabic is just one part of the languages and dialects newly supported by Galaxy AI and available for download from the Settings app. Galaxy AI’s language features such as Live Translate and Interpreter are available on Galaxy devices running Samsung’s One UI 6.1 update.<sup>2</sup></p>
<div class="youtube_wrap"><iframe loading="lazy" src="https://www.youtube.com/embed/KOU1HXipelo?rel=0" width="300" height="150" frameborder="0" allowfullscreen="allowfullscreen"><span data-mce-type="bookmark" style="width: 0px;overflow: hidden;line-height: 0" class="mce_SELRES_start">﻿</span><span style="width: 0px;overflow: hidden;line-height: 0" data-mce-type="bookmark" class="mce_SELRES_start"></span></iframe></div>
<p><span style="font-size: small"><em><sup>1</sup> UNESCO, World Arabic Language Day 2023, <a href="https://www.unesco.org/en/world-arabic-language-day" target="_blank" rel="noopener">https://www.unesco.org/en/world-arabic-language-day<br />
</a><sup>2</sup> One UI 6.1 was first released on Galaxy S24 series devices with a wider roll out to other Galaxy devices including S23 series, S23 FE, S22 series, S21 series, Z Fold5, Z Fold4, Z Fold3, Z Flip5, Z Flip4, Z Flip3, Tab S9 series and Tab S8 series</em></span></p>
]]></content:encoded>
																				</item>
					<item>
				<title>The Learning Curve, Part 1: Why Teaching AI New Languages Begins With Data</title>
				<link>https://news.samsung.com/global/the-learning-curve-part-1-why-teaching-ai-new-languages-begins-with-data</link>
				<pubDate>Fri, 10 May 2024 14:50:29 +0000</pubDate>
								<media:content url="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve-Part-1_AI_Thumb728.jpg" medium="image" />
				<dc:creator><![CDATA[Samsung Newsroom]]></dc:creator>
						<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Automatic speech recognition]]></category>
		<category><![CDATA[Galaxy AI]]></category>
		<category><![CDATA[Live Translate]]></category>
		<category><![CDATA[Neural Machine Translation]]></category>
		<category><![CDATA[Samsung R&D Institute]]></category>
		<category><![CDATA[Samsung R&D Institute Indonesia]]></category>
		<category><![CDATA[Text-to-speech]]></category>
		<category><![CDATA[The Learning Curve]]></category>
                <guid isPermaLink="false">https://bit.ly/3QFV6sh</guid>
									<description><![CDATA[As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as […]]]></description>
																<content:encoded><![CDATA[<p><img loading="lazy" class="alignnone size-full wp-image-151822" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/05/Learning-Curve-Part-1_AI_main1.jpg" alt="" width="1000" height="667" /></p>
<p>As Samsung continues to pioneer premium mobile AI experiences, we visit Samsung Research centers around the world to learn how Galaxy AI is enabling more users to maximize their potential. Galaxy AI now supports 16 languages, so more people can expand their language capabilities, even when offline, thanks to on-device translation in features such as Live Translate, Interpreter, Note Assist and Browsing Assist. But what does AI language development involve? This series examines the challenges of working with mobile AI and how we overcame them. First up, we head to Indonesia to learn where one begins teaching AI to speak a new language.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-151826" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/05/Learning-Curve-Part-1_AI_main2.jpg" alt="" width="1000" height="667" /></p>
<p>The first step is establishing targets, according to the team at Samsung R&D Institute Indonesia (SRIN). “Great AI begins with good quality and relevant data. Each language demands a different way to process this, so we dive deep to understand the linguistic needs and the unique conditions of our country,” says Junaidillah Fadlil, Head of AI at SRIN, whose team recently added Bahasa Indonesia (Indonesian language) support to Galaxy AI. “Local language development has to be led by insight and science, so every process for adding languages to Galaxy AI starts with us planning what information we need and can legally and ethically obtain.”</p>
<p>Galaxy AI features such as Live Translate perform three core processes: automatic speech recognition (ASR), neural machine translation (NMT) and text-to-speech (TTS). Each process needs a distinct set of information.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-151827" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/05/Learning-Curve-Part-1_AI_main3.jpg" alt="" width="1000" height="667" /></p>
<p>ASR, for instance, needs extensive recordings of speech in numerous environments, each paired with an accurate text transcription. Varying background noise levels help account for different environments. “It’s not enough just to add noises to recordings,” explains Muchlisin Adi Saputra, the team’s ASR lead. “In addition to the language data we obtained from authorized <span>third-party</span> partners, we must go out into coffee shops or working environments to record our own voices. This allows us to authentically capture unique sounds from real life, like people calling out or the clattering of keyboards.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-151828" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/05/Learning-Curve-Part-1_AI_main4.jpg" alt="" width="1000" height="667" /></p>
<p>The ever-changing nature of languages must also be considered. Saputra adds, “We need to keep up to date with the latest slang and how it is used, and mostly we find it on social media!”</p>
<p>Next, NMT requires translation training data. “Translating Bahasa Indonesia is challenging,” says Muhamad Faisal, the team’s NMT lead. “Its extensive use of contextual and implicit meanings relies on social and situational cues, so we need numerous translated texts that the AI could reference for new words, foreign words, proper nouns and idioms – any information that helps AI understand the context and rules of communication.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-151846" src="https://img.global.news.samsung.com/global/wp-content/uploads/2025/05/Learning-Curve-Part-1_AI_main8.jpg" alt="" width="1000" height="666" /></p>
<p>TTS then requires recordings that cover a range of voices and tones, with additional context on how parts of words sound in different circumstances. “Good voice recordings could do half the job and cover all the required phonemes (units of sound in speech) for the AI model,” adds Harits Abdurrohman, TTS lead. “If a voice actor did a great job in the earlier phase, the focus shifts to refining the AI model to clearly pronounce specific words.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-151829" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/05/Learning-Curve-Part-1_AI_main5.jpg" alt="" width="1000" height="667" /></p>
<h3><span style="color: #000080"><strong>Stronger Together</strong></span></h3>
<p>It takes vast resources to plan for much data, and SRIN worked closely with linguistics experts. “This challenge requires creativity, resourcefulness and expertise in both Bahasa Indonesia and machine learning,” Fadlil reflects. “Samsung’s philosophy of open collaboration played a big part in getting the job done, as did our scale of operations and history of AI development.”</p>
<p>Working with other Samsung Research centers around the world, the SRIN team was able to quickly adopt best practices and overcome the complexities of establishing data targets. Furthermore, collaboration was good for advancing not only technology but also culture. When the SRIN team joined their counterparts in Bangalore, India, they observed the local fasting customs, creating deeper connections and expanding their understanding of different cultures.</p>
<p><img loading="lazy" class="alignnone size-full wp-image-151830" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/05/Learning-Curve-Part-1_AI_main6.jpg" alt="" width="1000" height="667" /></p>
<p>For the team, Galaxy AI’s language expansion project took on a new significance. “We are particularly proud of our achievements here as this was our first AI project, and it won’t be our last as we continue to refine our models and improve the quality of output,” Fadlil concludes. “This expansion not only reflects our values of openness but also respects and incorporates our cultural identities through language.”</p>
<p><img loading="lazy" class="alignnone size-full wp-image-151831" src="https://img.global.news.samsung.com/global/wp-content/uploads/2024/05/Learning-Curve-Part-1_AI_main7.jpg" alt="" width="1000" height="667" /></p>
<p>In the next episode of The Learning Curve, we will head to Samsung R&D Institute Jordan to speak to the team who led Galaxy AI’s Arabic language project. Tune in to learn about the complexities of building and training an AI model for a language with diverse dialects.</p>
]]></content:encoded>
																				</item>
			</channel>
</rss>