I can't seem to figure out how to get this print out all the words including the duplicates

Question

I am trying to get this to print out all the words that are on a text file in ascending order. When I run it, it prints out in ascending order, but it only prints one occurrence of the word. I want it to print out every occurrence of the word(duplicates wanted). I am not sure what I'm doing wrong. Also I would like it to only print out the words and not the punctuation marks that are in the text file. I know I need to use the "split", just not sure how to properly use it. I've worked with it once before but can not remember how to apply it here.

This is the code I have so far:

public class DisplayingWords {

public static void main(String[] args) throws 
        FileNotFoundException, IOException 
{
    Scanner ci = new Scanner(System.in);
    System.out.print("Please enter a text file to open: ");
    String filename = ci.next();
    System.out.println("");

    File file = new File(filename);
    BufferedReader br = new BufferedReader(new FileReader(file));

    StringBuilder sb = new StringBuilder();
    String str;
    while((str = br.readLine())!= null)

    {
/*
 * This is where i seem to be having my problems.
 * I have only ever used a split once before and can not 
 * remember how to properly use it. 
 * i am trying to get the print out to avoid printing out 
 * all the punctuation marks and have only the words
 */

      //  String[] str = str.split("[ 
	
.,;:!?(){}]");
        str.split("[ 
	
.,;:!?(){}]");
        sb.append(str);
        sb.append(" ");
        System.out.println(str);
    }

    ArrayList text = new ArrayList<>();
    StringTokenizer st = new StringTokenizer(sb.toString().toLowerCase());
            while(st.hasMoreTokens()) 
            {
                String s = st.nextToken();
                text.add(s);
            }

            System.out.println("
" + "Words Printed out in Ascending "
                                + "(alphabetical) order: " + "
");

            HashSet set = new HashSet<>(text);
            List arrayList = new ArrayList<>(set);
            Collections.sort(arrayList);
            for (Object ob : arrayList)
                System.out.println("	" + ob.toString());
    }
}

Sam I am says Reinstate Monica · Accepted Answer

your duplicates are probably being stripped out here

HashSet set = new HashSet<>(text);

a set generally does not contain duplicates, so I'd just sort your text array list

Collections.sort(text);
for (Object ob : text)
    System.out.println("	" + ob.toString());

I can't seem to figure out how to get this print out all the words including the duplicates

Answers (2)

Related Questions

I can&#39;t seem to figure out how to get this print out all the words including the duplicates

Answers (2)

Related Questions

I can't seem to figure out how to get this print out all the words including the duplicates