Implementation of Logistic regression with Gradient Descent in Java

Question

I have implemented Logistic Regression with Gradient Descent in Java. It doesn't seem to work well (It does not classify records properly; the probability of y=1 is a lot.) I don't know whether my implementation is correct.I have gone through the code several times and i am unable to find any bug. I have been following Andrew Ng's tutorials on Machine learning on Course Era. My Java implementation has 3 classes. namely :

DataSet.java : To read the data set
Instance.java : Has two members : 1. double[ ] x and 2. double label
Logistic.java : This is the main class that implements Logistic Regression with Gradient Descent.

This is my cost function:

J(Θ) = (- 1/m ) [Σ^m_i=1 y⁽ⁱ⁾ log( h_Θ( x⁽ⁱ⁾ ) ) + (1 - y⁽ⁱ⁾) log(1 - h_Θ (x⁽ⁱ⁾) )]

For the above Cost function, this is my Gradient Descent algorithm:

Repeat (

Θ_j := Θ_j - α Σ^m_i=1 ( h_Θ( x⁽ⁱ⁾) - y⁽ⁱ⁾ ) x⁽ⁱ⁾_j

(Simultaneously update all Θ_j )

)

import java.io.FileNotFoundException;
import java.util.Arrays;
import java.util.Collections;
import java.util.List;

public class Logistic {

    /** the learning rate */
    private double alpha;

    /** the weight to learn */
    private double[] theta;

    /** the number of iterations */
    private int ITERATIONS = 3000;

    public Logistic(int n) {
        this.alpha = 0.0001;
        theta = new double[n];
    }

    private double sigmoid(double z) {
        return (1 / (1 + Math.exp(-z)));
    }

    public void train(List instances) {

    double[] temp = new double[3];

    //Gradient Descent algorithm for minimizing theta
    for(int i=1;i<=ITERATIONS;i++)
    {
       for(int j=0;j<3;j++)
       {      
        temp[j]=theta[j] - (alpha * sum(j,instances));
       }

       //simulataneous updates of theta  
       for(int j=0;j<3;j++)
       {
         theta[j] = temp[j];
       }
        System.out.println(Arrays.toString(theta));
    }

    }

    private double sum(int j,List instances)
    {
        double[] x;
        double prediction,sum=0,y;


       for(int i=0;i instances = DataSet.readDataSet("data.txt");
      // 3 : number of theta parameters corresponding to the features x 
      // x0 is always 1   
        Logistic logistic = new Logistic(3);
        logistic.train(instances);

        //Test data
        double[]x = new double[3];
        x[0]=1;
        x[1]=45;
        x[2] = 85;

        System.out.println("Prob: "+logistic.classify(x));


    }
}

Can anyone tell me what am I doing wrong? Thanks in advance! :)

Implementation of Logistic regression with Gradient Descent in Java

J(Θ) = (- 1/m ) [Σ^m_i=1 y⁽ⁱ⁾ log( h_Θ( x⁽ⁱ⁾ ) ) + (1 - y⁽ⁱ⁾) log(1 - h_Θ (x⁽ⁱ⁾) )]

Repeat (

Θ_j := Θ_j - α Σ^m_i=1 ( h_Θ( x⁽ⁱ⁾) - y⁽ⁱ⁾ ) x⁽ⁱ⁾_j

)

Answers (1)

Related Questions

Implementation of Logistic regression with Gradient Descent in Java

J(Θ) = (- 1/m ) [Σmi=1 y(i) log( hΘ( x(i) ) ) + (1 - y(i) ) log(1 - hΘ (x(i)) )]

Repeat (

Θj := Θj - α Σmi=1 ( hΘ( x(i)) - y(i) ) x(i)j

)

Answers (1)

Related Questions

J(Θ) = (- 1/m ) [Σ^m_i=1 y⁽ⁱ⁾ log( h_Θ( x⁽ⁱ⁾ ) ) + (1 - y⁽ⁱ⁾) log(1 - h_Θ (x⁽ⁱ⁾) )]

Θ_j := Θ_j - α Σ^m_i=1 ( h_Θ( x⁽ⁱ⁾) - y⁽ⁱ⁾ ) x⁽ⁱ⁾_j