(Java) How do you check a string for two or more digit integers using .matches()?

Question

The objective of this program is to return the atomic mass from a given formula (as a String), using a CSV file that contains elements and their atomic masses. My code for this particular problem is this: I have another method &#8220;fileReader&#8221; that takes the data from the mile and returns a map with t…

Accepted Answer

I’ve reimplemented your method getAtom(). The main change that has been done to it is that instead of processing a formula character by character it splits the formula into chunks that represent either uppercase letters, combinations of an uppercase and a lowercase letter, or number.That’s the code for it:String[] elementsAndIndices = formula.split("(?<=\p{Lower})(?=\p{Upper})|(?<=\p{Upper})(?=\p{Upper})|(?<=\D)(?=\d)|(?<=\d)(?=\D)");Let’s examine what’s going on here:d – a digit: [0-9];D – a non-digit: [^0-9];p{Lower} – a lower-case alphabetic character: [a-z];p{Upper} – an upper-case alphabetic character:[A-Z];p{Alpha} – an alphabetic character:[p{Lower}p{Upper}];Special constructs that start with a question mark are called lookbehind (?<=\p{Lower}) and lookahead (?=\p{Upper}). Both match a zero-length string that allows to split the formula in a manner that I described above without losing any symbol (you can read more about them here).Meanings of lookbehind and lookbehind combinations that are used to split the formula:(?<=p{Lower})(?=p{Upper}) – matches a zero-length string between a lower-case character p{Lower} and an upper-case character p{Upper};(?<=p{Upper})(?=p{Upper}) – matches a zero-length string on the border between two upper-case characters p{Upper};(?<=D)(?=d) – matches a zero-length string between a non-digit D and a digit d;(?<=d)(?=D) – matches a zero-length string between a digit d and a non-digit D.Method merge() is used to update the map. It takes three arguments: key, value and remapping function. If the given key is absent (or this key is associated with null value) then this key will be associated with the given value. The remapping function, which is used to merge two values will be evaluated only if the provided key is already present and the value for it is not null.elements.merge(elementsAndIndices[i], 1, Integer::sum);The line of code shown above associates value 1 with a key elementsAndIndices[i] if it’s not present in the map otherwise, it’ll merge the existing value with 1 producing a sum. Method reference Integer::sum is an equivalent of lambda (val1, val2) -> val1 + val2;. public static Map getAtoms(String formula) { Map elements = new LinkedHashMap<>(); String[] elementsAndIndices = formula.split("(?<=\p{Lower})(?=\p{Upper})|(?<=\p{Upper})(?=\p{Upper})|(?<=\D)(?=\d)|(?<=\d)(?=\D)"); for (int i = 0; i < elementsAndIndices.length; i++) { if (elementsAndIndices[i].matches("\p{Alpha}+")) { if (i == elementsAndIndices.length - 1 || !elementsAndIndices[i + 1].matches("\d+")) { elements.merge(elementsAndIndices[i], 1, Integer::sum); } else { elements.merge(elementsAndIndices[i], Integer.parseInt(elementsAndIndices[i + 1]), Integer::sum); i++; } } } return elements; } public static void main(String[] args) { System.out.println(getAtoms("C6H12O6")); System.out.println(getAtoms("NH3")); System.out.println(getAtoms("RuSH2112")); }output{C=6, H=12, O=6}{N=1, H=3}{Ru=1, S=1, H=2112}

Advertisement

Answer