-
SEA-LION: Southeast Asian Languages in One Network
Paper • 2504.05747 • Published -
Do Large Language Models Speak All Languages Equally? A Comparative Study in Low-Resource Settings
Paper • 2408.02237 • Published -
A Three-Pronged Approach to Cross-Lingual Adaptation with Multilingual LLMs
Paper • 2406.17377 • Published -
Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts
Paper • 2306.11372 • Published
Collections
Discover the best community collections!
Collections including paper arxiv:2401.01055
-
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 55 -
Improving Text Embeddings with Large Language Models
Paper • 2401.00368 • Published • 82 -
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Paper • 2403.13447 • Published • 19 -
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Paper • 2403.05530 • Published • 66
-
The Impact of Reasoning Step Length on Large Language Models
Paper • 2401.04925 • Published • 18 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 55 -
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Paper • 2401.05033 • Published • 18 -
Towards Conversational Diagnostic AI
Paper • 2401.05654 • Published • 20
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper • 2401.02038 • Published • 65 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 188 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 55 -
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Paper • 2401.01325 • Published • 27
-
RakutenAI-7B: Extending Large Language Models for Japanese
Paper • 2403.15484 • Published • 15 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 55 -
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Paper • 2404.04167 • Published • 14 -
abhinand/malayalam-llama-7b-instruct-v0.1
Text Generation • Updated • 6 • 13
-
A Simple Framework to Accelerate Multilingual Language Model for Monolingual Text Generation
Paper • 2401.10660 • Published • 2 -
PersianMind: A Cross-Lingual Persian-English Large Language Model
Paper • 2401.06466 • Published • 5 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 55 -
MaLA-500: Massive Language Adaptation of Large Language Models
Paper • 2401.13303 • Published • 12
-
SEA-LION: Southeast Asian Languages in One Network
Paper • 2504.05747 • Published -
Do Large Language Models Speak All Languages Equally? A Comparative Study in Low-Resource Settings
Paper • 2408.02237 • Published -
A Three-Pronged Approach to Cross-Lingual Adaptation with Multilingual LLMs
Paper • 2406.17377 • Published -
Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts
Paper • 2306.11372 • Published
-
RakutenAI-7B: Extending Large Language Models for Japanese
Paper • 2403.15484 • Published • 15 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 55 -
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Paper • 2404.04167 • Published • 14 -
abhinand/malayalam-llama-7b-instruct-v0.1
Text Generation • Updated • 6 • 13
-
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 55 -
Improving Text Embeddings with Large Language Models
Paper • 2401.00368 • Published • 82 -
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Paper • 2403.13447 • Published • 19 -
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Paper • 2403.05530 • Published • 66
-
A Simple Framework to Accelerate Multilingual Language Model for Monolingual Text Generation
Paper • 2401.10660 • Published • 2 -
PersianMind: A Cross-Lingual Persian-English Large Language Model
Paper • 2401.06466 • Published • 5 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 55 -
MaLA-500: Massive Language Adaptation of Large Language Models
Paper • 2401.13303 • Published • 12
-
The Impact of Reasoning Step Length on Large Language Models
Paper • 2401.04925 • Published • 18 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 55 -
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Paper • 2401.05033 • Published • 18 -
Towards Conversational Diagnostic AI
Paper • 2401.05654 • Published • 20
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper • 2401.02038 • Published • 65 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 188 -
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper • 2401.01055 • Published • 55 -
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Paper • 2401.01325 • Published • 27