Text Classification: Amazon reviews classification

Classify amazon reviews into positive and negative using OpenNN

Generally, the feedback provided by a customer on a product can be categorized into positive and negative. Interpreting customer feedback through product reviews helps companies evaluate how satisfied the customers are with their products/services. This example aims to assess whether a review of an Amazon product could be positive or negative from its content.

string review_1 = "Highly recommend for any one who has a bluetooth phone.";
Tensor<type,1> processed_review_1 = data_set.sentence_to_data(review_1);
string review_2 = "You have to hold the phone at a particular angle for the other party to hear you clearly.";
Tensor<type,1> processed_review_2 = data_set.sentence_to_data(review_2);

Once the reviews are transformed into numeric tensors, their predictions can be calculated utilizing:

Tensor<type,2> input_data(2, words_number);
for(Index i = 0; i < words_number; i++)
{
  input_data(0,i) = processed_review_1(i);
  input_data(1,i) = processed_review_2(i);
}
Tensor<type,2> outputs = neural_network.calculate_outputs(input_data);

You can also save the model using:

neural_network.save_expression_c("../data/expression.txt");
neural_network.save_expression_python("../data/expression.txt");

The model can be implemented in python, php, … .

7. Full Code

Joining all steps, we obtain the following code:

// DataSet
DataSet data_set;
data_set.set_data_file_name("path_to_source/amazon_cells_labelled.txt");
data_set.set_text_separator(DataSet::Separator::Tab);
data_set.read_txt();
data_set.split_samples_random();
const Tensor<string, 1> input_words = data_set.get_input_columns_names();
const Tensor<string, 1> targets_names = data_set.get_target_variables_names();
const Index words_number = data_set.get_input_variables_number();
const Index target_variables_number = data_set.get_target_variables_number();
            
// Neural Network
const Index hidden_neurons_number = 6;
NeuralNetwork neural_network(NeuralNetwork::ProjectType::TextClassification,
    {words_number , hidden_neurons_number, target_variables_number});
            
// Training Strategy
TrainingStrategy training_strategy(&neural_network, &data_set);
training_strategy.set_loss_method(TrainingStrategy::LossMethod::CROSS_ENTROPY_ERROR);
training_strategy.set_optimization_method(TrainingStrategy::OptimizationMethod::ADAPTIVE_MOMENT_ESTIMATION);
training_strategy.perform_training();
            
// Testing Analysis
TestingAnalysis testing_analysis(&neural_network, &data_set);
testing_analysis.print_binary_classification_tests();
            
// Model deployment
string review_1 = "Highly recommend for any one who has a bluetooth phone.";
Tensor<type,1> processed_review_1 = data_set.sentence_to_data(review_1);
string review_2 = "You have to hold the phone at a particular angle for the other party to hear you clearly.";
Tensor<type,1> processed_review_2 = data_set.sentence_to_data(review_2);
Tensor<type,2> input_data(2, words_number);
for(Index i = 0; i < words_number; i++)
{
  input_data(0,i) = processed_review_1(i);
  input_data(1,i) = processed_review_2(i);
}
Tensor<type,2> outputs = neural_network.calculate_outputs(input_data);
            
// Save results
neural_network.save_expression_c("../data/amazon_reviews.txt");
neural_network.save_expression_python("../data/amazon_reviews.py");

This code can be exported to your C++ project.

References:

The data for this problem has been taken from the Kaggle Repository.

Text Classification:
Amazon reviews classification

Classify amazon reviews into positive and negative using OpenNN

Contents:

1. Application type

2. Data set

3. Neural network

4. Training strategy

5. Testing analysis

6. Model deployment

7. Full Code

References:

Artificial Intelligence Techniques, Ltd.

Find Us At:

© 2024 - Artificial Intelligence Techniques, Ltd. All rights reserved.