Pham Van Toan - Hoang Dinh Thoi - Do Hoang Thai Duong - Ta Minh Thanh 2019 19th International Symposium on Communications and Information Technologies (ISCIT)
Techinical: Information Security, Image Steganography, Secure Data Transmission, Deep Convolutional Neural Network
In this paper, our work focuses on solving image steganography with Deep Learning models. The main job is to hide an image (secret image) inside another image of the same size (cover photo). Through our tests, we have proven that this method offers optimum performance. The results compared with the research of Google Research and Shanghai University show that our method has superior advantages over similar research.
Pham Huu Quang - Hoang Dinh Thoi - Pham Van Toan - Ta Minh Thanh 2019 IEEE 8th Global Conference on Consumer Electronics (GCCE 2019)
Techinical: Information Security, Steganography, Secure Data Transmission, Deep Convolutional Neural Network.
Steganography is the science of concealing secret information inside usual forms of data. In this paper, the use of deep learning techniques to hide secret audio into the digital images is proposed. Extensive experiments are carried out with a set of 24K images and an audio dataset named VIVOS Corpus. Through experimental results, it has been confirmed that our method is more effective than traditional approaches. The integrity of both image and audio is well preserved while the length of the hidden audio is significantly improved.
Pham Van Toan, Ta Minh Thanh, Nguyen Thanh Trung, Pham Thi Hong Anh Proceedings of the ISSAT International Conference on Data Science in Business, Finance and Industry (DSBFI 2019)
Techinical: Image alignment, similarity features filtering, feature matching, feature-extraction.
In this paper, we propose a new approach for feature matching method called similarity features filtering and some techniques applying on invoices image pre-processing to improve the image alignment accuracy. The experimental results show that our proposed approach can achieve better results than other feature-based methods.
Pham Van Toan, Nguyen Thanh Hau and Ta Minh Thanh 2018 5th NAFOSTED Conference on Information and Computer Science (NICS)
Techinical: Natural language processing, audio processing with MFCC, sequence length, recurrent neural network with tensorflow.
The paper proposes a novel approach using deep learning to address the problem of phonetic recognition. Specifically, we combine the Mel Frequency Cepstral Coefficients (MFCC) method with sequence-length to present the acoustic features of speech and use different RNN architectures to phonetic classification. Besides, the well-known TIMIT dataset is used in both the training phase and evaluation phase. Currently, we have achieved the lowest error rate (13.05% PER) by using Bidirectional LSTM, which is the best result in TIMIT dataset with the reduction of about 3.5% compared to the last best result.
Pham Van Toan, Hoang Dinh Thoi, Pham Hoang Anh, Nguyen Thanh Hau, Ta Minh Thanh Proceedings of the Ninth International Symposium on Information and Communication Technology. ACM, 2018.
Techinical: Object detection with SSD MobilenetV2, Triplet loss,Quantization indexing, Similarity learning, image retrieval.
In the paper, we propose a fashion search system, which automatically recognizes clothes and suggests multiple similar clothing items with an impressively low latency. Through extensive experiments, it is verified that our system outperforms all existing systems in term of clothing item retrieval time.
Pham Thi Hong Anh ACM RecSys challenge 2018
Techinical: Recommendation with Colaborative Filtering and SVD, Matrix Factorization, Content based learning.
In the ACM RecSys challenge 2018, the goal is to build a recommendation system which can automatically recommend multiple suitable songs for users. With the provided dataset by Spotify, we have employed different algorithms and techniques and achieved the top 15 best result.
Pham Van Toan - Ta Minh Thanh - Nguyen Thanh Hau The 2018 Vietnam joint Conference on Artificial Intelligence for Life (AI4Life-2018)
Techinical: Speech Recognition, Mispronunciation Evaluation, Goodness of Pronunciation Estimation.
In these paper, we tried some models Deep learning like CNN, RNN and combining them for phonetic classification Japanese. The study was applied in Talky Bird - a mobile application that detects Japanese learners' pronunciation errors.
Pham Van Toan, Nguyen Hoang Huy Vietnam Mathematics and Applications 2016
Techinical: Lasso Regression, Combine Features, Feature Extraction for Real Estate data.
In this paper we propose a new technique to predict the real estate pricing in Long Bien districs, Viet Nam and Montreal district, Canada. The experimental result has verified that our proposed method can generate better real estate pricing prediction than both traditional linear regression algorithm and support vector machine (SVM).
Pham Van Toan, Ta Minh Thanh The 21st Asia Pacific Symposium on Intelligent and Evolutionary Systems Conference 2017
Techinical: Bag of Word, Keywords Extraction, Neural Network, Text Classification.
Text classification has become one of the main applications in the field of natural language processing. There have been many proposed approaches to address this problem; however, most of them only applied to English documents. In this paper, we employ Bag of Words (BoW), keywords extraction technique, and Neural Network approach to classify Vietnamese news. According to the experimental evaluation, the accuracy is reported to be 99.75%.