We do not disclose the datasets used to train our models.
Some assets remain proprietary, including:
the training datasets
the training logic and resources required to produce both open-source and optimized models
π Keeping these elements private helps us protect our intellectual property and maintain model quality.