Lowercase all HashMap keys

前端 未结 4 1242
夕颜
夕颜 2021-02-12 14:17

I \'ve run into a scenario where I want to lowercase all the keys of a HashMap (don\'t ask why, I just have to do this). The HashMap has some millions of entries.

At fir

相关标签:
4条回答
  • 2021-02-12 14:23

    You cannot remove the entry while iterating over the map. You will have a ConcurentModificationException if you try to do this.

    As the issue is an OutOfMemoryError, not a performance error, using parallel stream will not help either.

    Despite some task on the Stream API will be done lately, this will still lead to have two maps in memory at some point so you will still have the issue.

    To workaround it, I only saw two ways :

    • Give more memory to your process (by increasing -Xmx on the Java command line). Memory is cheap these days ;)
    • Split the map and work in chunks : for example you divide the size of the map by ten and you process one chunck at a time and delete the processed entries before processing the new chunk. By this instead of having two times the map in memory you will just have 1.1 times the map.

    For the split algorithm, you can try someting like this using the Stream API :

    Map<String, String> toMap = new HashMap<>();            
    int chunk = fromMap.size() / 10;
    for(int i = 1; i<= 10; i++){
        //process the chunk
        List<Entry<String, String>> subEntries = fromMap.entrySet().stream().limit(chunk)
            .collect(Collectors.toList());  
    
        for(Entry<String, String> entry : subEntries){
            toMap.put(entry.getKey().toLowerCase(), entry.getValue());
            fromMap.remove(entry.getKey());
        }
    }
    
    0 讨论(0)
  • 2021-02-12 14:25

    Not sure about the memory footprint. If using Kotlin, you can try the following.

    val lowerCaseMap = myMap.mapKeys { it.key.toLowerCase() }
    

    https://kotlinlang.org/api/latest/jvm/stdlib/kotlin.collections/map-keys.html

    0 讨论(0)
  • 2021-02-12 14:27

    the concerns in the above answers are correct and you might need to reconsider changing the data structure you are using.

    for me, I had a simple map I needed to change its keys to lower case

    take a look at my snippet, its a trivial solution and bad at performance

    private void convertAllFilterKeysToLowerCase() {
        HashSet keysToRemove = new HashSet();
        getFilters().keySet().forEach(o -> {
            if(!o.equals(((String) o).toLowerCase()))
                keysToRemove.add(o);
        });
        keysToRemove.forEach(o -> getFilters().put(((String) o).toLowerCase(), getFilters().remove(o)));
    }
    
    0 讨论(0)
  • 2021-02-12 14:33

    Instead of using HashMap, you could try using a TreeMap with case-insensitive ordering. This would avoid the need to create a lower-case version of each key:

    Map<String, Long> map = new TreeMap<>(String.CASE_INSENSITIVE_ORDER);
    map.putAll(myMap);
    

    Once you've constructed this map, put() and get() will behave case-insensitively, so you can save and fetch values using all-lowercase keys. Iterating over keys will return them in their original, possibly upper-case forms.

    Here are some similar questions:

    • Case insensitive string as HashMap key
    • Is there a good way to have a Map<String, ?> get and put ignoring case?
    0 讨论(0)
提交回复
热议问题