AroundAround
Sign in Sign up
Menu
  • Home
    • Home
  • Blog
    Technology
    All Technology Software Engineering Web Applications Java Spring Framework Spring Boot AWS
    All Posts
    Smash Posts Transportation Posts Driving Posts Multi Location Post
    Transport
    Driving Articles Driving License Resources Refresh Drive Success Stories Special Articles
    Research
    Popular lists Weblogs Tutorials
    News
    Education Health Sports
    Traffic Signs
    Dubai Traffic Signs Qatar Traffic Signs Abu Dhabi Traffic Signs Saudi Arabia Traffic Signs Kuwait Traffic Signs Kerala Traffic Signs
  • Insights
    Interview Series
    All Interview Series Java Questions Spring Questions Microservices Questions Database Questions
    Lifestyle
    Umrah Articles Dubai UAE visa 180 days calculator Qatar residence visa 180 days calculator Prayer Time Malappuram Sqm to Cent calculator Kerala
    City Explorer
    Sulthan Bathery
  • Mock Tests
    Driving
    Test Home
    Dubai
    RTA Theory Test Parking Knowledge Test Deep-dive Assessment Test
    Abu Dhabi
    RTA Driving Theory Test Deep-dive Assessment Test Saudi Arabia Computer Test
    Other Regions
    Qatar Driving Theory Test Kuwait Driving Theory Test Ajman RTA Theory Test Sharjah RTA Theory Test Bahrain Driving Theory Test Learners License Test - Kerala
    Education
    Kindergarten School
  • Course & Videos
    • CoursesEnroll today
    • Videos20+Watch & Subscribe
  • Questions and Tags
    Topics
    Don't miss Million dollar questions Million Views Day to day life Interview Junky Trendy questions Theory wizard
    Technology
    Java Spring AWS
    Find the differences
    All Find the differences List
    Tags
    macOS AWS Spring
  • About Us
    • Privacy Policy
    • Contact
    • Terms & Conditions
    • Cancellation & Refund Policy
    • Shipping & Delivery Policy
  1. Home
  2. java
  3. Post

Sanitize HTML from String

Emma Brown
admin
#java #String handling
Share post:
Share

Tired of regex failing you when stripping HTML from Java strings? There's a cleaner, more robust way. Ready to ditch brittle `replaceAll()`?

How can I reliably remove HTML tags from a Java String, avoiding issues with entities like `&` and preventing unintended removal of content within tags?

Solution in a Nutshell

Removing HTML tags in Java/Spring Boot is straightforward using regular expressions or dedicated libraries. For simple scenarios, a regex suffices:

import java.util.regex.Matcher;
import java.util.regex.Pattern;

public class HtmlTagRemover {
    public static String removeHtmlTags(String html) {
        Pattern pattern = Pattern.compile("<.*?>"); // Matches any tag
        Matcher matcher = pattern.matcher(html);
        return matcher.replaceAll("");
    }

    public static void main(String[] args) {
        String html = "<p>This is <b>some</b> text.</p>";
        String text = removeHtmlTags(html);
        System.out.println(text); // Output: This is some text.
    }
}

This uses a simple regex "<.*?>". For more robust handling of edge cases (e.g., nested tags, malformed HTML), consider using a dedicated HTML parser like Jsoup:

import org.jsoup.Jsoup;

public class HtmlTagRemoverJsoup {
    public static String removeHtmlTags(String html) {
        return Jsoup.parse(html).text();
    }

    public static void main(String[] args) {
        String html = "<p>This is <b>some</b> text.</p>";
        String text = removeHtmlTags(html);
        System.out.println(text); // Output: This is some text.
    }
}

Remember to add the Jsoup dependency to your pom.xml:

<dependency>
    <groupId>org.jsoup</groupId>
    <artifactId>jsoup</artifactId>
    <version>1.15.4</version> </dependency>

Jsoup offers better handling of complex HTML and avoids potential regex pitfalls. Choose the method best suited to your needs; for simple cases, regex is sufficient; for complex or potentially malicious HTML, Jsoup is recommended. Always sanitize user input before processing to prevent XSS vulnerabilities.

Sidebar

Search

Tags

#java #String handling

Trending posts

Post

How to iterate through the HashMap in Java?

Post

How Can I convert a String to an int in Java?

Post

How to get a key from the value in Java HashMap

Post

In javascript how to replace all occurrences of a String?

  • Contact Us
  • Privacy Policy

© Copyright - smashplus 2013-25.