3.1 KiB
Falcon
Overview
Falcon is a class of causal decoder-only models built by TII. The largest Falcon checkpoints have been trained on >=1T tokens of text, with a particular emphasis on the RefinedWeb corpus. They are made available under the Apache 2.0 license.
Falcon's architecture is modern and optimized for inference, with multi-query attention and support for efficient
attention variants like FlashAttention
. Both 'base' models trained only as causal language models as well as
'instruct' models that have received further fine-tuning are available.
Falcon models are (as of 2023) some of the largest and most powerful open-source language models, and consistently rank highly in the OpenLLM leaderboard.
Converting custom checkpoints
Falcon models were initially added to the Hugging Face Hub as custom code checkpoints. However, Falcon is now fully
supported in the Transformers library. If you fine-tuned a model from a custom code checkpoint, we recommend converting
your checkpoint to the new in-library format, as this should give significant improvements to stability and
performance, especially for generation, as well as removing the need to use trust_remote_code=True
!
You can convert custom code checkpoints to full Transformers checkpoints using the convert_custom_code_checkpoint.py
script located in the
Falcon model directory
of the Transformers library. To use this script, simply call it with
python convert_custom_code_checkpoint.py --checkpoint_dir my_model
. This will convert your checkpoint in-place, and
you can immediately load it from the directory afterwards with e.g. from_pretrained()
. If your model hasn't been
uploaded to the Hub, we recommend making a backup before attempting the conversion, just in case!
FalconConfig
autodoc FalconConfig - all
FalconModel
autodoc FalconModel - forward
FalconForCausalLM
autodoc FalconForCausalLM - forward
FalconForSequenceClassification
autodoc FalconForSequenceClassification - forward
FalconForTokenClassification
autodoc FalconForTokenClassification - forward
FalconForQuestionAnswering
autodoc FalconForQuestionAnswering - forward