Ask Experts Questions for FREE Help !
Ask
    ubharedev's Avatar
    ubharedev Posts: 134, Reputation: 1
    Junior Member
     
    #1

    Aug 31, 2010, 05:09 AM
    Auto Summarisation or text summarisation
    How can I developed an application for text summarisation in java.
    What algorithm should I use.
    Is it easy to work with Eclipse or netbeans for this project.
    Please help.
    It is for my college project.

    Thanks in advanced.
    codeguru's Avatar
    codeguru Posts: 3, Reputation: 1
    New Member
     
    #2

    Sep 21, 2010, 03:09 AM
    Hi,
    Please explain what is the text summarisation you are looking for ? An example will help.
    I would recommend Eclipse, it is much easier compared to Netbeans, I had used Eclipse for many large scale software development.


    ubharedev's Avatar
    ubharedev Posts: 134, Reputation: 1
    Junior Member
     
    #3

    Sep 23, 2010, 04:27 AM
    Quote Originally Posted by codeguru View Post
    Hi,
    Please explain what is the text summarisation you are looking for ? an example will help.
    I would recommend Eclipse, it is much easier compared to Netbeans, I had used Eclipse for many large scale software development.
    Thanks for reply.

    I want to develop a tool that takes input text from file and then
    It makes summary based on the given words by user.
    User will give keywords to summarize a document.
    Keywords will be searched in whole document then depends on that
    Sentences which contains that keywords will be picked up.
    That sentences we will be our output.
    How can I do that.

    Main possibility is that the same keyword can be found any number of
    Times.How can I tackle that.
    I will go with Eclipse doesn't matter.

    Please reply.
    codeguru's Avatar
    codeguru Posts: 3, Reputation: 1
    New Member
     
    #4

    Sep 23, 2010, 08:42 PM
    Assume you have two files content.txt having the text to summarize and keyword.txt having the keyword
    Use BufferedReader to read the keywords into memory array/ list
    Use BufferedReader to read each line from content.txt into String.
    For each keyword Use matches method in the String class to check whether the keyword is present in the line
    If present output the line and also increment a counter for the keyword
    Continue with next line
    At the end output how many times each keyword is found.

    If you have strict performance in mind this will need some tuning.
    Let me know if you need sample code
    HTML Code:
    <script type="text/javascript"><!--
    google_ad_client = "pub-7374813015672333";
    google_ad_width = 468;
    google_ad_height = 60;
    google_ad_format = "468x60_as";
    google_ad_type = "text_image";
    google_ad_channel = "";
    google_color_border = "336699";
    google_color_bg = "FFFFCC";
    google_color_link = "0000FF";
    google_color_text = "000000";
    google_color_url = "008000";
    google_ui_features = "rc:10";
    //-->
    </script> 
    <script type="text/javascript"
    
      src="http://pagead2.googlesyndication.com/pagead/show_ads.js"> 
    </script>


    codeguru's Avatar
    codeguru Posts: 3, Reputation: 1
    New Member
     
    #5

    Sep 23, 2010, 08:44 PM
    Assume you have two files content.txt having the text to summarize and keyword.txt having the keyword
    Use BufferedReader to read the keywords into memory array/ list
    Use BufferedReader to read each line from content.txt into String.
    For each keyword Use matches method in the String class to check whether the keyword is present in the line
    if present output the line and also increment a counter for the keyword
    Continue with next line
    at the end output how many times each keyword is found.

    If you have strict performance in mind this will need some tuning.
    Let me know if you need sample code
    <script type="text/javascript"><!--
    google_ad_client = "pub-7374813015672333";
    google_ad_width = 468;
    google_ad_height = 60;
    google_ad_format = "468x60_as";
    google_ad_type = "text_image";
    google_ad_channel = "";
    google_color_border = "336699";
    google_color_bg = "FFFFCC";
    google_color_link = "0000FF";
    google_color_text = "000000";
    google_color_url = "008000";
    google_ui_features = "rc:10";
    //-->
    </script>
    <script type=";
    //-->
    </script>
    <script type="

    src="http://pagead2.googlesyndication.com/pagead/show_ads.js">
    </script>
    ubharedev's Avatar
    ubharedev Posts: 134, Reputation: 1
    Junior Member
     
    #6

    Sep 24, 2010, 01:38 AM
    I want to just picked up the sentences which contains the keywords I searched for in file.
    But possibility is that search can give more than two times for same keyword.
    So how can I differentiate between the lines which contains the keyword.

    Example:
    Note: Suppose this is main text file contains the following sentences.

    "This is the program for auto summarization in java.
    Java is simple amazing language.
    Need to do project on java."

    If I searched for keyword "java" than it gives me three times from file.
    Than how can I make my program more intelligent so that it would picked up the sentence which has more weighted or importance.
    I heard about the scoring algorithm.But I don't know about it.
    Only I know that its gives score to the keyword searched higher the score of the searched keyword.
    It will only picked that sentence from all searched sentence.
    Let me know if you can do something in this or any other alternative methods.

Not your question? Ask your question View similar questions

 

Question Tools Search this Question
Search this Question:

Advanced Search


Check out some similar questions!

Excel 15 digit issue, tried converting to text, text to column feature negates fix [ 6 Answers ]

I have the following numbers that exceed 15 characters that needs to be split into its own columns. Down the road, there would be thousands of such rows of data with the first couple set of unique numbers. 890432453253208820,5004500558,05CC,1,0,0,0,0,0,0, 0000,5.0000,2007-01-11...

How to set up Word 2007 to auto-fill text? [ 3 Answers ]

I have search all over just to add a simple field or form to a template, that I can reuse in the documents I create. What I really want is a place-holder text called "Purchaser Name", and another called "Client Name", so that in a new document I could just click anywhere on "Purchaser Name" and...

How can I get a 9-digit numeric datapoint to format as text, and link up to text [ 2 Answers ]

How can I get a 9-digit numeric datapoint to format as text, and link up to text? I was able to convert a column of 9-digit Social Security Numbers to 9-digit text, but they don't link up to =if statetments as I try to match them up. Thanks for your help

Convert image (pdf, typewriter text) to text file [ 21 Answers ]

I've played with 3 different OCR apps (including one that seems to be tops (Abbyy Fine Reader Pro) and find it too much. Easy to use but far too time consuming for my skills with it. See the attached pdf (38 pages of typed text). I have it and 5 others that need converted but am finding it too...


View more questions Search