Foxconn Unveils First Large Language Model 'FoxBrain'

Hon Hai Research Institute has introduced Taiwanâs first traditional Chinese large language model (LLM), known as FoxBrain, representing a significant evolution in Taiwanâs AI and technology innovation.
The new model, which was developed using a more efficient and lower cost training method that took just four weeks, is designed to enhance manufacturing, supply chain management and intelligent decision making processes.
The institute, operating under Hon Hai Technology Group (Foxconn), says the AI system demonstrates strong reasoning capabilities and is optimised for local language patterns.
It was initially developed for applications used in Foxconnâs internal systems including data analysis, decision-making, document collaboration, mathematics, coding and problem-solving.
The company now plans to make FoxBrain open-source and shared publicly in the future.
Dr Yung-Hui Li, Director of the AI Research Center at Hon Hai Research Institute, says: âThrough carefully designed training methods and resource optimisation, we have successfully built a local AI model with powerful reasoning capabilities.â
Foxconnâs AI strategy and Nvidiaâs role
FoxBrain is trained using 120 Nvidia H100 GPUs, which are connected via Nvidiaâs Quantum-2 InfiniBand networking technology. This system facilitates high-speed data transfer, a crucial factor in AI model development.
During training, Nvidia supported Foxconn through its Taipei-1 Supercomputer facility, providing both computational power and technical guidance.
The model is built using Nvidiaâs NeMo framework, a toolset designed to develop and customise AI models efficiently.
FoxBrain adopts the Llama 3.1 architecture developed by Meta and integrates 70 billion parametersâvalues adjusted by the AI as it learns. Foxconn states that its model surpasses Llama-3-Taiwan-70B, a similarly sized traditional Chinese AI model, in multiple performance categories.
Testing indicates that FoxBrain excels in mathematical reasoning compared to the base Meta Llama 3.1 model. Foxconn also claims it outperforms Taiwan Llama, which it describes as the leading Traditional Chinese language model available.
Mathematical reasoning and model capabilities
FoxBrain demonstrates strength in logical reasoning and mathematics, as measured by the TMMLU+ benchmark. This benchmark assesses AI performance across various knowledge domains.
To enhance training, Foxconn applied data augmentation techniques, generating 98 billion tokens of pre-training data across 24 topics. In AI models, tokens represent units of text processed by the system.
FoxBrainâs context window is 128,000 tokens, allowing it to retain extensive conversational history or document content, offering broader context awareness than models with smaller windows.
FoxBrainâs open-source potential and future applications
Foxconn acknowledges that while a performance gap remains between FoxBrain and DeepSeekâs distillation modelâanother AI system focused on knowledge transferâits model approaches âworld-leading standardsâ.
The development involved multiple stages, including data collection, cleaning, augmentation, continual pre-training, supervised fine-tuning and reinforcement learning from AI feedback (RLAIF). Additionally, Foxconn implemented a technique called âAdaptive Reasoning Reflectionâ to enhance the modelâs capabilities.
- Developed proprietary data augmentation and quality assessment techniques for 24 topic categories
- Trained the model using 120 Nvidia H100 GPUs over a total of 2,688 GPU days
- Implemented a multi-node parallel training framework to ensure optimal performance and system stability
- Introduced an Adaptive Reasoning Reflection method to enhance the model's autonomous reasoning capabilities
Although initially developed for internal use, Foxconn plans to collaborate with industry partners to expand FoxBrain’s applications in manufacturing, supply chain management and business decision-making.
The company will showcase FoxBrain at the Nvidia GTC 2025 conference on 20 March in a session titled ‘From Open Source to Frontier AI: Build, Customise and Extend Foundation Models.’
"In recent months, the deepening of reasoning capabilities and the efficient use of GPUs have gradually become the mainstream development in the field of AI. Our FoxBrain model adopted a very efficient training strategy, focusing on optimising the training process rather than blindly accumulating computing power,” says Dr. Yung-Hui Li.
“Through carefully designed training methods and resource optimisation, we have successfully built a local AI model with powerful reasoning capabilities."
Explore the latest edition of AI Magazine and be part of the conversation at our global conference series, Tech & AI LIVE.
Discover all our upcoming events and secure your tickets today.
AI Magazine is a BizClik brand


