Introducing the paligemma tuning service, a visual language model in your custom dataset
PaliGemma offers a range of capabilities including OCR, Image Captioning, Visual Question Answering, and Document Understanding. For more information about the model's abilities, please visit the following website: https://huggingface.co/blog/paligemma
Potential application scenarios include OCR and receipt understanding.
To fine-tune the model, please provide a dataset with images and caption text.
For any questions or ideas, feel free to contact me directly.
Expected deliverables include the fine-tuned model file, example python code for inference with the fine-tuned model, and a Containerfile.
Shop Location | Auckland, New Zealand |
No reviews found!
No comments found for this product. Be the first to comment!