How to parse in Java?

Tutorial with code examples about tools to generate a parser in Java
15 August 2017   1582

There are three ways to parse a language or a document from Java:

  • with an existing library supporting specific language: for example a library to parse XML
  • creating your own parser
  • a tool or library to generate a parser

Use An Existing Library

You can use an existing library if you need to parse well known and supported language, like XML or HTML. A high quality library also include API for building and modifing documents in the language. This is what you can get from a basic parser. The main problem is that ther are not so much  quality libraries and the they support only the most popular languages. 

Building Your Own Custom Parser By Hand

You can choose this option if you have specific needs. If the language you need to parse cannot be parsed with traditional parser generators, or you have specific requirements that you cannot satisfy using a typical parser generator. Or maybe you need the best possible performance or a deep connection between components.

A Tool Or Library To Generate A Parser

The third option is a best one for all other situations. It is the most flexible option and has the shorter development time. Learn more what tool or library to chose to generate a parser from this tutorial.

Tiobe February 2019 Index Released

Groove language is in top 20 again; it's popularity increased due to Gradle; Java is 1st
11 February 2019   334

In February, search engine statistics showed that the dynamically typed Groovy language is gaining popularity again. As analysts of TIOBE noted, it was already in the top twenty of the TIOBE index in 2016 on the wave of interest in the Jenkins system for continuous integration. Now it is pushed forward by the Gradle assembly automation system.

TIOBE Februry 2019 Index
TIOBE Februry 2019 Index

More analysts noticed that the language Hack this month entered the top 50, and TypeScript left it (in their opinion, temporarily). Since February last year, Python and C ++ swapped places, and again topped the list with Java and C.