Hello everyone,
I’m encountering an issue while using Google cloud console (Vertex AI studio's Chat section) to compare two documents written in different languages but with the same content. I expect the system to identify all differences or mismatches between the documents, but it only returns 2–3 mismatches in a single response, even though there are more than 5 mismatches in reality.
Here are the details of my situation:
Is there a way to ensure the system provides a comprehensive list of all mismatches in one response? Am I potentially missing a configuration or parameter setting that affects the output?
Thank you for your help!
Hi @PhuuPwint,
Welcome to Google Cloud Community!
I understand that you're trying to get a comparison of two documents in different languages using Vertex AI Studio's Chat section, specifically Gemini 1.5 pro-002, but you're only seeing a partial list of mismatches.
Here's why your current approach is unlikely to succeed:
You may try these approaches:
1. Ensure that the model parameters are set correctly. Sometimes, adjusting parameters like temperature can affect the completeness of the output. While you haven't mentioned this, the model's temperature parameter controls the randomness of responses. A lower temperature (closer to 0) might lead to more consistent results. You could try a lower temperature if you're getting wildly inconsistent results between attempts.
2. Review your prompt design. Make sure it clearly instructs the model to list all mismatches. You might want to experiment with different phrasings to see if they yield better results.
3. If the documents are long, consider breaking them into smaller sections and processing them in batches. This might help the model handle the comparison more effectively. You may refer to this documentation for reference.
4. Verify that the model you're using supports both languages you're working with. Some models might have better support for certain languages than others. In addition, you've specified using gemini 1.5 pro-002. Note that newer models will likely be better at this task. Ensure you're using the most up-to-date model version, if possible.
5. Use the feedback mechanism in Vertex AI to report this issue. Google's team might provide insights or updates to improve the functionality.
Here’s a sample documentation that you may find useful:
Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.
Dear @ruthseki,
Thank you for providing an explanation regarding my issue.
To summarize, can I confirm that the issue may likely be related to the context window and token limit, despite the context window size being 2M and the output token limit of the model being 8192?
Based on your suggestions, here are the approaches I have already tried:
One thing is the documents I tested are up to 9 pages of PDF.
I appreciate your assistance so far. Could you kindly provide further guidance regarding my concerns outlined above?
Thank you once again for your help.
Phuu Pwint
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |