<?xml version="1.0"?>
<records>
  <record>
    <language>eng</language>
    <publisher>Ansari Education and Research Society</publisher>
    <journalTitle>Journal of Ultra Scientist of Physical Sciences</journalTitle>
    <issn/>
    <eissn/>
    <publicationDate>September 2025</publicationDate>
    <volume>37</volume>
    <issue>4</issue>
    <startPage>29</startPage>
    <endPage>39</endPage>
    <doi>http://dx.doi.org/10.22147/jusps-B/370401</doi>
    <publisherRecordId>1547</publisherRecordId>
    <documentType>article</documentType>
    <title language="eng">Applications for AI and ML in the analysis of unstructured data across various sectors</title>
    <authors>
      <author>
        <name>FARHA KHAN</name>
        <affiliationId>1</affiliationId>
      </author>
      <author>
        <name>PRATIMA OJHA</name>
        <affiliationId>2</affiliationId>
      </author>
      <author>
        <name>GHIZAL F. ANSARI</name>
        <affiliationId>2</affiliationId>
      </author>
    </authors>
    <affiliationsList>
      <affiliationName affiliationId="1">Department of Mathematics, Madhyanchal Professional University, Bhopal-462001 (INDIA)</affiliationName>
      <affiliationName affiliationId="2">Department of Physics, Madhyanchal Professional University, Bhopal-462001 (INDIA)</affiliationName>
    </affiliationsList>
    <abstract language="eng">&lt;p&gt;The main focus of the topic is the process of transforming a collection of unstructured text documents into structured information based on mathematical and statistical principles. To begin, we&amp;rsquo;ll look at document models via the lens of the Bernoulli method, where the existence or absence of tokens the fundamental building elements of documents forms the foundation. Multinomial document model is the center of attention in an additional issue. It resembles the Bernoulli model in many ways, but instead of using the presence flag, it uses the frequentist approach, which considers how often the tokens appear in the text. To get latent topical structure across text sources and to fine-tune with the use of machine learning, we move onto researching unsupervised topic modeling strategies in the following challenge. Finally, using unstructured data analysis, we provide a model for predicting users&amp;rsquo;moods and actions on social media. A model that may capture user behavior and mood on social media is the Behavior Dirichlet Probability Model (BDPM).&lt;/p&gt;&#xD;
</abstract>
    <fullTextUrl format="html">https://ultraphysicalsciences.org/paper/1547/</fullTextUrl>
    <keywords>
      <keyword language="eng">Latent Semantic Indexing</keyword>
    </keywords>
    <keywords>
      <keyword language="eng">social media</keyword>
    </keywords>
    <keywords>
      <keyword language="eng">unstructured text, Machine learning</keyword>
    </keywords>
  </record>
</records>
