I have spent a lot of time working in the fields of statistics-based machine learning and natural language processing within the field of artificial intelligence.
I have experience in all stages of the process, from data collection crawling, data generation, data preprocessing, model training, model hyperparameter tuning, model quantization, model merging, and model deployment, and I am constantly keeping up with the rapidly evolving field of AI.
In my free time after work, I dedicate my time to Kaggle competitions.
- Data collection (crawling) and generation (gpt, anthropic, gemini)
- Data preprocessing
- Model training (axolotl, deepspeed, flashattention, liger kernel) and hyperparameter tuning (wandb)
- Model quantization
- Model merging
- Model deployment (vllm, gguf)
Python