I think this works amazing, it makes so detailed image captioning. I tried with a few pics, and gp

I think this works amazing, it makes so detailed image captioning. I tried with a few pics, and gpt4 vs Llma3.. I think Llma3 is trained way better than gpt. in every picture of me, llama gave better prompting.

gpt4o
The image shows an individual standing in a parking lot. The person is wearing a black t-shirt and grey pants, and has a watch on the left wrist. The face is obscured by a brown rectangle, likely for privacy reasons. In the background, there are parked cars and trees, suggesting this might be near a park or a commercial area.

vs
Lma3

The image shows a man standing in a parking lot. He is wearing a black t-shirt and olive green cargo pants. He has a short haircut and a beard. He is looking directly at the camera with a neutral expression. In the background, there are several cars parked in the lot, and there is a structure that appears to be a part of a building or a canopy. The sky is overcast, suggesting it might be a cloudy day.
Was this page helpful?