Java 8 Iterable.forEach() vs foreach loop

前端 未结 8 1156
暗喜
暗喜 2020-11-22 14:36

Which of the following is better practice in Java 8?

Java 8:

joins.forEach(join -> mIrc.join(mSession, join));

Java 7:



        
相关标签:
8条回答
  • 2020-11-22 15:06

    TL;DR: List.stream().forEach() was the fastest.

    I felt I should add my results from benchmarking iteration. I took a very simple approach (no benchmarking frameworks) and benchmarked 5 different methods:

    1. classic for
    2. classic foreach
    3. List.forEach()
    4. List.stream().forEach()
    5. List.parallelStream().forEach

    the testing procedure and parameters

    private List<Integer> list;
    private final int size = 1_000_000;
    
    public MyClass(){
        list = new ArrayList<>();
        Random rand = new Random();
        for (int i = 0; i < size; ++i) {
            list.add(rand.nextInt(size * 50));
        }    
    }
    private void doIt(Integer i) {
        i *= 2; //so it won't get JITed out
    }
    

    The list in this class shall be iterated over and have some doIt(Integer i) applied to all it's members, each time via a different method. in the Main class I run the tested method three times to warm up the JVM. I then run the test method 1000 times summing the time it takes for each iteration method (using System.nanoTime()). After that's done i divide that sum by 1000 and that's the result, average time. example:

    myClass.fored();
    myClass.fored();
    myClass.fored();
    for (int i = 0; i < reps; ++i) {
        begin = System.nanoTime();
        myClass.fored();
        end = System.nanoTime();
        nanoSum += end - begin;
    }
    System.out.println(nanoSum / reps);
    

    I ran this on a i5 4 core CPU, with java version 1.8.0_05

    classic for

    for(int i = 0, l = list.size(); i < l; ++i) {
        doIt(list.get(i));
    }
    

    execution time: 4.21 ms

    classic foreach

    for(Integer i : list) {
        doIt(i);
    }
    

    execution time: 5.95 ms

    List.forEach()

    list.forEach((i) -> doIt(i));
    

    execution time: 3.11 ms

    List.stream().forEach()

    list.stream().forEach((i) -> doIt(i));
    

    execution time: 2.79 ms

    List.parallelStream().forEach

    list.parallelStream().forEach((i) -> doIt(i));
    

    execution time: 3.6 ms

    0 讨论(0)
  • 2020-11-22 15:10

    The better practice is to use for-each. Besides violating the Keep It Simple, Stupid principle, the new-fangled forEach() has at least the following deficiencies:

    • Can't use non-final variables. So, code like the following can't be turned into a forEach lambda:
    Object prev = null;
    for(Object curr : list)
    {
        if( prev != null )
            foo(prev, curr);
        prev = curr;
    }
    
    • Can't handle checked exceptions. Lambdas aren't actually forbidden from throwing checked exceptions, but common functional interfaces like Consumer don't declare any. Therefore, any code that throws checked exceptions must wrap them in try-catch or Throwables.propagate(). But even if you do that, it's not always clear what happens to the thrown exception. It could get swallowed somewhere in the guts of forEach()

    • Limited flow-control. A return in a lambda equals a continue in a for-each, but there is no equivalent to a break. It's also difficult to do things like return values, short circuit, or set flags (which would have alleviated things a bit, if it wasn't a violation of the no non-final variables rule). "This is not just an optimization, but critical when you consider that some sequences (like reading the lines in a file) may have side-effects, or you may have an infinite sequence."

    • Might execute in parallel, which is a horrible, horrible thing for all but the 0.1% of your code that needs to be optimized. Any parallel code has to be thought through (even if it doesn't use locks, volatiles, and other particularly nasty aspects of traditional multi-threaded execution). Any bug will be tough to find.

    • Might hurt performance, because the JIT can't optimize forEach()+lambda to the same extent as plain loops, especially now that lambdas are new. By "optimization" I do not mean the overhead of calling lambdas (which is small), but to the sophisticated analysis and transformation that the modern JIT compiler performs on running code.

    • If you do need parallelism, it is probably much faster and not much more difficult to use an ExecutorService. Streams are both automagical (read: don't know much about your problem) and use a specialized (read: inefficient for the general case) parallelization strategy (fork-join recursive decomposition).

    • Makes debugging more confusing, because of the nested call hierarchy and, god forbid, parallel execution. The debugger may have issues displaying variables from the surrounding code, and things like step-through may not work as expected.

    • Streams in general are more difficult to code, read, and debug. Actually, this is true of complex "fluent" APIs in general. The combination of complex single statements, heavy use of generics, and lack of intermediate variables conspire to produce confusing error messages and frustrate debugging. Instead of "this method doesn't have an overload for type X" you get an error message closer to "somewhere you messed up the types, but we don't know where or how." Similarly, you can't step through and examine things in a debugger as easily as when the code is broken into multiple statements, and intermediate values are saved to variables. Finally, reading the code and understanding the types and behavior at each stage of execution may be non-trivial.

    • Sticks out like a sore thumb. The Java language already has the for-each statement. Why replace it with a function call? Why encourage hiding side-effects somewhere in expressions? Why encourage unwieldy one-liners? Mixing regular for-each and new forEach willy-nilly is bad style. Code should speak in idioms (patterns that are quick to comprehend due to their repetition), and the fewer idioms are used the clearer the code is and less time is spent deciding which idiom to use (a big time-drain for perfectionists like myself!).

    As you can see, I'm not a big fan of the forEach() except in cases when it makes sense.

    Particularly offensive to me is the fact that Stream does not implement Iterable (despite actually having method iterator) and cannot be used in a for-each, only with a forEach(). I recommend casting Streams into Iterables with (Iterable<T>)stream::iterator. A better alternative is to use StreamEx which fixes a number of Stream API problems, including implementing Iterable.

    That said, forEach() is useful for the following:

    • Atomically iterating over a synchronized list. Prior to this, a list generated with Collections.synchronizedList() was atomic with respect to things like get or set, but was not thread-safe when iterating.

    • Parallel execution (using an appropriate parallel stream). This saves you a few lines of code vs using an ExecutorService, if your problem matches the performance assumptions built into Streams and Spliterators.

    • Specific containers which, like the synchronized list, benefit from being in control of iteration (although this is largely theoretical unless people can bring up more examples)

    • Calling a single function more cleanly by using forEach() and a method reference argument (ie, list.forEach (obj::someMethod)). However, keep in mind the points on checked exceptions, more difficult debugging, and reducing the number of idioms you use when writing code.

    Articles I used for reference:

    • Everything about Java 8
    • Iteration Inside and Out (as pointed out by another poster)

    EDIT: Looks like some of the original proposals for lambdas (such as http://www.javac.info/closures-v06a.html Google Cache) solved some of the issues I mentioned (while adding their own complications, of course).

    0 讨论(0)
  • 2020-11-22 15:13

    forEach() can be implemented to be faster than for-each loop, because the iterable knows the best way to iterate its elements, as opposed to the standard iterator way. So the difference is loop internally or loop externally.

    For example ArrayList.forEach(action) may be simply implemented as

    for(int i=0; i<size; i++)
        action.accept(elements[i])
    

    as opposed to the for-each loop which requires a lot of scaffolding

    Iterator iter = list.iterator();
    while(iter.hasNext())
        Object next = iter.next();
        do something with `next`
    

    However, we also need to account for two overhead costs by using forEach(), one is making the lambda object, the other is invoking the lambda method. They are probably not significant.

    see also http://journal.stuffwithstuff.com/2013/01/13/iteration-inside-and-out/ for comparing internal/external iterations for different use cases.

    0 讨论(0)
  • 2020-11-22 15:14

    One of most upleasing functional forEach's limitations is lack of checked exceptions support.

    One possible workaround is to replace terminal forEach with plain old foreach loop:

        Stream<String> stream = Stream.of("", "1", "2", "3").filter(s -> !s.isEmpty());
        Iterable<String> iterable = stream::iterator;
        for (String s : iterable) {
            fileWriter.append(s);
        }
    

    Here is list of most popular questions with other workarounds on checked exception handling within lambdas and streams:

    Java 8 Lambda function that throws exception?

    Java 8: Lambda-Streams, Filter by Method with Exception

    How can I throw CHECKED exceptions from inside Java 8 streams?

    Java 8: Mandatory checked exceptions handling in lambda expressions. Why mandatory, not optional?

    0 讨论(0)
  • 2020-11-22 15:20

    The advantage comes into account when the operations can be executed in parallel. (See http://java.dzone.com/articles/devoxx-2012-java-8-lambda-and - the section about internal and external iteration)

    • The main advantage from my point of view is that the implementation of what is to be done within the loop can be defined without having to decide if it will be executed in parallel or sequential

    • If you want your loop to be executed in parallel you could simply write

       joins.parallelStream().forEach(join -> mIrc.join(mSession, join));
      

      You will have to write some extra code for thread handling etc.

    Note: For my answer I assumed joins implementing the java.util.Stream interface. If joins implements only the java.util.Iterable interface this is no longer true.

    0 讨论(0)
  • 2020-11-22 15:22

    The advantage of Java 1.8 forEach method over 1.7 Enhanced for loop is that while writing code you can focus on business logic only.

    forEach method takes java.util.function.Consumer object as an argument, so It helps in having our business logic at a separate location that you can reuse it anytime.

    Have look at below snippet,

    • Here I have created new Class that will override accept class method from Consumer Class, where you can add additional functionility, More than Iteration..!!!!!!

      class MyConsumer implements Consumer<Integer>{
      
          @Override
          public void accept(Integer o) {
              System.out.println("Here you can also add your business logic that will work with Iteration and you can reuse it."+o);
          }
      }
      
      public class ForEachConsumer {
      
          public static void main(String[] args) {
      
              // Creating simple ArrayList.
              ArrayList<Integer> aList = new ArrayList<>();
              for(int i=1;i<=10;i++) aList.add(i);
      
              //Calling forEach with customized Iterator.
              MyConsumer consumer = new MyConsumer();
              aList.forEach(consumer);
      
      
              // Using Lambda Expression for Consumer. (Functional Interface) 
              Consumer<Integer> lambda = (Integer o) ->{
                  System.out.println("Using Lambda Expression to iterate and do something else(BI).. "+o);
              };
              aList.forEach(lambda);
      
              // Using Anonymous Inner Class.
              aList.forEach(new Consumer<Integer>(){
                  @Override
                  public void accept(Integer o) {
                      System.out.println("Calling with Anonymous Inner Class "+o);
                  }
              });
          }
      }
      
    0 讨论(0)
提交回复
热议问题