Question 1

What is extractive summarization and how does it differ from AI summarization?

Accepted Answer

Extractive summarization selects and returns actual sentences from the original text that are scored as most important. The output contains only words that appeared in the input — there is no paraphrasing or generation of new content. AI-based (abstractive) summarization, by contrast, generates new sentences that may paraphrase or combine information from multiple parts of the text. Extractive summarization is deterministic (same input always gives the same output), does not hallucinate, and requires no cloud API or subscription.

Question 2

How does the word frequency scoring algorithm work?

Accepted Answer

The algorithm: (1) splits text into sentences on punctuation boundaries, (2) builds a word frequency map from all words in the text, excluding common stop words in both English and Korean, (3) normalizes each word frequency by dividing by the highest frequency in the document, (4) scores each sentence as the mean of the normalized frequencies of its content words, (5) applies a position boost — sentences that appear earlier get up to 20% additional score — to reflect that topic sentences and key information often appear at the beginning of paragraphs and articles, (6) selects the top N sentences and returns them in their original document order.

Question 3

Why are some important sentences not included in the summary?

Accepted Answer

The algorithm scores sentences based on how many high-frequency (important) words they contain relative to their length. Sentences that introduce new terminology, provide context, or use unique words may receive low scores even if they are subjectively important. The algorithm is a heuristic and works best on well-structured text where important concepts are repeated across multiple sentences. For texts with very uniform word frequency distributions, the results may be less distinctive.

Question 4

What is the minimum text length required?

Accepted Answer

The tool requires at least 50 characters of input text. This minimum ensures there is enough content to meaningfully split into sentences and score. Very short texts (under 50 characters) are typically a single sentence and cannot be summarized — the tool will display an error message in this case.

Question 5

Does the tool support languages other than English and Korean?

Accepted Answer

The tool will process any text that uses standard punctuation (periods, exclamation marks, question marks) as sentence boundaries. The stop word lists cover English and Korean specifically, so other languages will still produce output but may not filter stop words as effectively. For European languages that share similar stop words with English (articles, prepositions, conjunctions), the quality should still be reasonable.

Question 6

Why does the summary sometimes include fewer sentences than I selected?

Accepted Answer

If the input text has fewer total sentences than the requested number (e.g., the text has 3 sentences but you requested 5), the tool returns all available sentences as the summary. The output cannot contain more sentences than exist in the input.

Question 7

Is my text sent to any server or AI API?

Accepted Answer

No. All text processing happens entirely within your web browser. The tokenization, word frequency calculation, sentence scoring, and output generation are all performed by JavaScript running locally on your device. Your text never leaves your browser, is never stored, and is never sent to any server, AI API, or third party. This makes the tool suitable for summarizing confidential documents, internal reports, or sensitive research.

Question 8

How should I choose the number of summary sentences?

Accepted Answer

As a general guideline: 3 sentences works well for articles up to 500 words (capturing the main point and two supporting ideas), 5 sentences is suitable for 500-1500 word articles, 7 sentences handles medium-length documents (1500-3000 words), and 10 sentences is appropriate for longer documents or when you need a more complete overview. The optimal setting depends on how much information density the source text contains and how much of the detail you want to preserve.

Text Summarizer

Usage Guide

About Text Summarizer

Key Features

Frequently Asked Questions