PUBLICATIONS

Simultaneous convolutional neural network for highly efficient image steganography

Pham Van Toan - Hoang Dinh Thoi - Do Hoang Thai Duong - Ta Minh Thanh 2019 19th International Symposium on Communications and Information Technologies (ISCIT) http://iscit2019.org

Techinical: Information Security, Image Steganography, Secure Data Transmission, Deep Convolutional Neural Network

In this paper, our work focuses on solving image steganography with Deep Learning models. The main job is to hide an image (secret image) inside another image of the same size (cover photo). Through our tests, we have proven that this method offers optimum performance. The results compared with the research of Google Research and Shanghai University show that our method has superior advantages over similar research.

Deep Neural Networks based Invisible Steganography for Audio-into-Image Algorithm

Pham Huu Quang - Hoang Dinh Thoi - Pham Van Toan - Ta Minh Thanh 2019 IEEE 8th Global Conference on Consumer Electronics (GCCE 2019) http://www.ieee-gcce.org/2019

Techinical: Information Security, Steganography, Secure Data Transmission, Deep Convolutional Neural Network.

Steganography is the science of concealing secret information inside usual forms of data. In this paper, the use of deep learning techniques to hide secret audio into the digital images is proposed. Extensive experiments are carried out with a set of 24K images and an audio dataset named VIVOS Corpus. Through experimental results, it has been confirmed that our method is more effective than traditional approaches. The integrity of both image and audio is well preserved while the length of the hidden audio is significantly improved.

Proposal of feature matching technique using similarity featuresfiltering for image alignment

Pham Van Toan, Ta Minh Thanh, Nguyen Thanh Trung, Pham Thi Hong Anh Proceedings of the ISSAT International Conference on Data Science in Business, Finance and Industry (DSBFI 2019) https://www.researchgate.net/publication/332696653_Proposal_of_Feature_Matching_Technique_Using_Similarity_Features_Filtering_for_Image_Alignment

Techinical: Image alignment, similarity features filtering, feature matching, feature-extraction.

In this paper, we propose a new approach for feature matching method called similarity features filtering and some techniques applying on invoices image pre-processing to improve the image alignment accuracy. The experimental results show that our proposed approach can achieve better results than other feature-based methods.

Improving Phonetic Recognition with Sequence-length Standardized MFCC Features and Deep Bi-directional LSTM

Pham Van Toan, Nguyen Thanh Hau and Ta Minh Thanh 2018 5th NAFOSTED Conference on Information and Computer Science (NICS) https://www.researchgate.net/publication/329705993_Improving_Phonetic_Recognition_with_Sequence-length_Standardized_MFCC_Features_and_Deep_Bi-Directional_LSTM

Techinical: Natural language processing, audio processing with MFCC, sequence length, recurrent neural network with tensorflow.

The paper proposes a novel approach using deep learning to address the problem of phonetic recognition. Specifically, we combine the Mel Frequency Cepstral Coefficients (MFCC) method with sequence-length to present the acoustic features of speech and use different RNN architectures to phonetic classification. Besides, the well-known TIMIT dataset is used in both the training phase and evaluation phase. Currently, we have achieved the lowest error rate (13.05% PER) by using Bidirectional LSTM, which is the best result in TIMIT dataset with the reduction of about 3.5% compared to the last best result.

Large scale fashion search system with deep learning and quantization indexing

Pham Van Toan, Hoang Dinh Thoi, Pham Hoang Anh, Nguyen Thanh Hau, Ta Minh Thanh Proceedings of the Ninth International Symposium on Information and Communication Technology. ACM, 2018. https://dl.acm.org/citation.cfm?id=3287964

Techinical: Object detection with SSD MobilenetV2, Triplet loss,Quantization indexing, Similarity learning, image retrieval.

In the paper, we propose a fashion search system, which automatically recognizes clothes and suggests multiple similar clothing items with an impressively low latency. Through extensive experiments, it is verified that our system outperforms all existing systems in term of clothing item retrieval time.

A Practical Solution to the ACM RecSys Challenge 2018

Pham Thi Hong Anh ACM RecSys challenge 2018 https://www.researchgate.net/publication/330304128_A_Practical_Solution_to_the_ACM_RecSys_Challenge_2018

Techinical: Recommendation with Colaborative Filtering and SVD, Matrix Factorization, Content based learning.

In the ACM RecSys challenge 2018, the goal is to build a recommendation system which can automatically recommend multiple suitable songs for users. With the provided dataset by Spotify, we have employed different algorithms and techniques and achieved the top 15 best result.

Deep learning ASR-based approach to non-native learner mispronunciation detection

Pham Van Toan - Ta Minh Thanh - Nguyen Thanh Hau The 2018 Vietnam joint Conference on Artificial Intelligence for Life (AI4Life-2018) https://ai4life.uet.vnu.edu.vn

Techinical: Speech Recognition, Mispronunciation Evaluation, Goodness of Pronunciation Estimation.

In these paper, we tried some models Deep learning like CNN, RNN and combining them for phonetic classification Japanese. The study was applied in Talky Bird - a mobile application that detects Japanese learners' pronunciation errors.

Aggregation of non linear features LASSO in real estate pricing

Pham Van Toan, Nguyen Hoang Huy Vietnam Mathematics and Applications 2016 http://www1.vnua.edu.vn/tapchi/Upload/9-2016-cntt.pdf

Techinical: Lasso Regression, Combine Features, Feature Extraction for Real Estate data.

In this paper we propose a new technique to predict the real estate pricing in Long Bien districs, Viet Nam and Montreal district, Canada. The experimental result has verified that our proposed method can generate better real estate pricing prediction than both traditional linear regression algorithm and support vector machine (SVM).

Vietnamese Text Classification based on BoW and Keywords Extraction with Neural Network

Pham Van Toan, Ta Minh Thanh The 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems Conference 2017 https://ieeexplore.ieee.org/document/8233559

Techinical: Bag of Word, Keywords Extraction, Neural Network, Text Classification.

Text classification has become one of the main applications in the field of natural language processing. There have been many proposed approaches to address this problem; however, most of them only applied to English documents. In this paper, we employ Bag of Words (BoW), keywords extraction technique, and Neural Network approach to classify Vietnamese news. According to the experimental evaluation, the accuracy is reported to be 99.75%.