Not Created Well Topic From BERTopic for Nepali Dataset #2265
Replies: 5 comments 5 replies
-
Generally, there is no need to preprocess the data. Whenever you use a different language than English, make sure you:
Also, make sure to read through the official docs. There is a lot of examples there that might help you out. |
Beta Was this translation helpful? Give feedback.
-
Thank you @MaartenGr For your quick response. I think paraphrase-multilingual-MiniLM-L12-v2 is Embedding Model for Multilingual,According to Official Document. I have used the same model for Nepali Text. |
Beta Was this translation helpful? Give feedback.
-
Thank you, @MaartenGr . Your advice has been really helpful for continuing my research. I will do my best to follow your suggestions. Additionally, I have another question. I have proposed research focusing solely on BERTopic, titled "Topic Modeling for Nepali Text Using BERTopic". My plan is to evaluate the model using metrics such as Coherence Score, Topic Diversity, and Silhouette Score. Is this a correct approach for evaluating the model, or are there other evaluation methods I should consider? I would greatly appreciate your guidance. |
Beta Was this translation helpful? Give feedback.
-
@hari-chalise @MaartenGr I am able to figure out this issue and generate Nepali topics. |
Beta Was this translation helpful? Give feedback.
-
Oh Great, in which model did you work? From Which university?
…On Sun, Jun 1, 2025 at 10:18 PM Saroj Dangol ***@***.***> wrote:
@hari-chalise <https://github.com/hari-chalise> @MaartenGr
<https://github.com/MaartenGr> I am able to figure out this issue and
generate Nepali topics.
I am working on this as my Graduation Thesis.
image.png (view on web)
<https://github.com/user-attachments/assets/830a0c10-01fb-44f5-ae1f-180e68eb5a4a>
—
Reply to this email directly, view it on GitHub
<#2265 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AM7ILK7FOW4ZZT76AXRX24T3BMTL7AVCNFSM6AAAAABVBHMT5SVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTGMZTGU2TMNA>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
This is the plot of derived topic and these are the code
Beta Was this translation helpful? Give feedback.
All reactions