Technical specifications and details for all CosmicFish models.
Balanced model for everyday AI tasks.
Bigger model with excellent performance.
Advanced model for coding and reasoning.
Enhanced position awareness for better context understanding.
Optimized attention reducing computational requirements by 40%.
Advanced activation function improving model convergence.
Efficient normalization enhancing stability and reducing cost.
4-bit and 8-bit precision reducing model size by 75%.
Billions of tokens from web, research papers, and code datasets.