AEAO: Auto Encoder with Adam Optimizer Method for Efficient Document Indexing Of Big Data
Y.Krishna Bhargavi1, Y.S.S.R.Murthy2, O.SRINIVASA RAO3

1Y.Krishna Bhargavi*, Department of CSE, Gokaraju Rangaraju Institute of Engineering and Technology, Hyderabad, India.
2Dr.Y.S.S.R.Murthy, Department of IT, SRKR Engineering College, Bhimavaram, India.
3Dr.O.Srinivasa Rao, Department of CSE, University College of Engineering, JNTUK, Kakinada, India. 

Manuscript received on 03August 2019. | Revised Manuscript received on 09 August 2019. | Manuscript published on 30 September 2019. | PP: 3933-3942 | Volume-8 Issue-3 September 2019 | Retrieval Number: C5141098319/2019©BEIESP | DOI: 10.35940/ijrte.C5141.098319
Open Access | Ethics and Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: In the big data era, the document classification became an active research area due to the explosive nature in the volumes of data. Document Indexing is one of the important tasks under text classification. The objective of this research is to increase the performance of the document indexing by proposing Adam optimizer in the auto-encoder. Due to the larger dimension and multi-class classification problem, the accuracy of document indexing is reduced. In this paper, an enhanced auto encoder is used based on the objective function of the Adam optimization (AEAO), which improves the learning rate and accuracy of indexing. The documents from the 20-newsgroup data set are converted into vector representation, and then the cosine similarity and Pearson correlation have been measured from the vector. The word to vector representation has words in the vector form and the frequency of words in the document increases their value. The Adam optimization technique selects the features by using similarity values and improves the learning rate. The auto encoder classifier classifies the document based on the objective function of the Adam optimizer. The experiment is conducted using python and the result infers that the classification performance of AEAO is better than that of Similarity-based classification framework for Multiple-Instance Learning and Self-Adaptive LSH encoding for multi-instance Learning techniques in terms of parameters like precision, recall and f-score.
Keywords: Adam Optimizer, Auto Encoder, Big Data, Cosine Similarity, Document Indexing, Pearson Correlation, Word to Vector Representation.

Scope of the Article:
Big Data Security