Tongyi Qianqian Qwen1.5

10mos agoupdate 253 0 0

Alibaba launched a large-scale language model with multiple parameter scales from 0.5B to 72B, supporting multilingual processing, long text comprehension, and excelling in several benchmark tests.

Location:

China

Language:

zh,en

Collection time:

2024-06-02

Open site Mobile view

Open Source Project

Tongyi Qianqian Qwen1.5

Open site

Tongyi Qianqian Qwen 1.5 is a large-scale language model introduced by Alibaba Group, which has been significantly updated and optimized in several aspects.

1. Overview of the model

parameter scale: The Qwen1.5 series of models provides from 0.5B to 72B, even including the Qwen1.5-110B model with more than 100 billion parameters, to meet different computational needs.
Model Type: Open source models in multiple versions, including Base and Chat, provide unprecedented convenience and opportunity for developers worldwide.

2. Core characteristics

Multilingualism enhancement: Qwen 1.5 has been significantly optimized in terms of multilingual processing capabilities, supporting a wider range of language types and more complex language scenarios. For example, it supports multiple languages including English, Chinese, French, Spanish, etc., and performs well in public dataset reviews covering four dimensions: subject knowledge test, semantic comprehension, translation tasks, and math problem solving.
Human preference alignment: Enhances the alignment of the model with human preferences by employing techniques such as Direct Policy Optimization (DPO) and Proximal Policy Optimization (PPO).
Long Sequence Support: All scales of the Qwen1.5 model support context lengths of up to 32,768 tokens, dramatically improving the ability to process long text.

3. Performance evaluation

Basic capacity assessment: Qwen1.5 showed significant improvement on several datasets including MMLU(5-shot), C-Eval, Humaneval, GS8K, BBH, etc. Especially the 72B version, which comprehensively outperforms Llama2-70B in all tests.
Verification of multilingual capabilities: Through a comprehensive evaluation of 12 major languages from Europe, East Asia and Southeast Asia, Qwen 1.5 demonstrates strong adaptability in a global multilingual environment.
Human Preference Alignment Test: On benchmarks such as MT-Bench and Alpaca-Eval, Qwen 1.5 demonstrates response quality that is highly consistent with human preferences.

4. Developer experience

Model Ease of Use: Alibaba has officially merged the Qwen 1.5 code into the Hugging Face transformers codebase, greatly simplifying the process of using the model. Now developers can directly use transformers>=4.37.0 native code without specifying additional options for development and deployment.
Partners and frameworks: Qwen 1.5 has also established partnerships with several well-known third-party frameworks, such as vLLM, SGLang, AutoAWQ, AutoGPTQ, etc., ensuring its global accessibility and ease of use.

5. Deployment and applications

PAI-QuickStart Support: With the PAI-QuickStart product component of the AliCloud AI platform PAI, users can easily fine-tune and quickly deploy Qwen 1.5 series models.
application scenario: Qwen1.5's powerful performance and multi-language support make it suitable for a variety of application scenarios, such as intelligent customer service, text generation, knowledge quizzes and so on.

Tongyiqian Qwen 1.5 is a powerful and easy-to-use large-scale language model that shows significant advantages in multilingual processing, long text support, and basic capabilities, bringing new breakthroughs in the field of artificial intelligence.

data statistics

Relevant Navigation

No comments

No comments...

Tongyi Qianqian Qwen1.5

1. Overview of the model

2. Core characteristics

3. Performance evaluation

4. Developer experience

5. Deployment and applications

data statistics

Relevant Navigation

FacePoke

SkyReels-V1

Dify AI

Gemma

DeepSeek-VL2

CogView4

BabelDOC

Hunyuan T1

No comments

Latest Articles

Popular Sites

Tongyi Qianqian Qwen1.5

1. Overview of the model

2. Core characteristics

3. Performance evaluation

4. Developer experience

5. Deployment and applications

data statistics

Relevant Navigation

FacePoke

SkyReels-V1

Dify AI

Gemma

DeepSeek-VL2

CogView4

BabelDOC

Hunyuan T1

No comments

Latest Articles

Popular Sites

Tag Cloud