47 lines
1.8 KiB
Markdown
47 lines
1.8 KiB
Markdown
<!--Copyright 2022 The HuggingFace Team and The OpenBMB Team. All rights reserved.
|
|
|
|
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with
|
|
the License. You may obtain a copy of the License at
|
|
|
|
http://www.apache.org/licenses/LICENSE-2.0
|
|
|
|
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on
|
|
an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the
|
|
specific language governing permissions and limitations under the License.
|
|
|
|
⚠️ Note that this file is in Markdown but contain specific syntax for our doc-builder (similar to MDX) that may not be
|
|
rendered properly in your Markdown viewer.
|
|
|
|
-->
|
|
|
|
# CPMAnt
|
|
|
|
## Overview
|
|
|
|
CPM-Ant is an open-source Chinese pre-trained language model (PLM) with 10B parameters. It is also the first milestone of the live training process of CPM-Live. The training process is cost-effective and environment-friendly. CPM-Ant also achieves promising results with delta tuning on the CUGE benchmark. Besides the full model, we also provide various compressed versions to meet the requirements of different hardware configurations. [See more](https://github.com/OpenBMB/CPM-Live/tree/cpm-ant/cpm-live)
|
|
|
|
This model was contributed by [OpenBMB](https://huggingface.co/openbmb). The original code can be found [here](https://github.com/OpenBMB/CPM-Live/tree/cpm-ant/cpm-live).
|
|
|
|
## Resources
|
|
|
|
- A tutorial on [CPM-Live](https://github.com/OpenBMB/CPM-Live/tree/cpm-ant/cpm-live).
|
|
|
|
## CpmAntConfig
|
|
|
|
[[autodoc]] CpmAntConfig
|
|
- all
|
|
|
|
## CpmAntTokenizer
|
|
|
|
[[autodoc]] CpmAntTokenizer
|
|
- all
|
|
|
|
## CpmAntModel
|
|
|
|
[[autodoc]] CpmAntModel
|
|
- all
|
|
|
|
## CpmAntForCausalLM
|
|
|
|
[[autodoc]] CpmAntForCausalLM
|
|
- all |