Download presentation
Presentation is loading. Please wait.
Published byGinger Anderson Modified over 9 years ago
1
Mining Billions of AST Nodes to Study Actual and Potential Usage of Java Language Features Robert Dyer The research activities described in this talk were supported in part by the US National Science Foundation (NSF) grants CCF-13-49153, CCF-13-20578, TWC-12-23828, CCF-11-17937, CCF-10-17334, and CCF-10-18600. Tien N. Nguyen Hridesh Rajan Hoan Anh Nguyen
2
2 Previous Language Studies What languages do programmers choose? [Meyerovich&Rabkin SPLASH'13] Reflection [Livshits et al. APLAS'05] [Callaú et al. MSR'11] JavaScript / eval [Yue&Wang WWW'09] [Richards et al. PLDI'10] [Ratanaworabhan et al. WEBAPPS'10] [Richards et al. ECOOP'11] Generics [Basit et al. SEKE'05] [Parnin et al. MSR'11] [Hoppe&Hanenberg SPLASH'13] Object-oriented Features [Tempero et al. ECOOP'08] [Muschevici et al. OOPSLA'08] [Tempero ASWEC'09] [Grechanik et al. ESEM'10] [Gorschek et al. ICSE'10]
3
What is this study about? How have new Java language features been adopted over time? Assume Java Corpus of 30k+ projects Study 18 new features from 3 language editions Over 10 years of history
4
4 Research Questions RQ1: Are language features used before release? RQ2: How frequently is each feature used? RQ3: How did committers/teams adopt features? RQ4: Could features have been used more? RQ5: Was old code converted to use new features?
5
How is Java's language defined? Java Language Specifications (JLS)
6
6 JLS2Java 1.4May 2002 JLS3Java 5September 2004 JLS4Java 7July 2011 JLS5Java 8March 2014
7
7 JLS2: New Language Features Assert assert i > 0; assert n != null;
8
8 JLS3: New Language Features Enhanced-For Loop for (T v : items)... Annotation Declaration @interface Test {} Enums enum E { N1,..} Annotation Use @Test void m() Generic Variables List l; Map m; Varargs void m(T... arg){ Generic Types interface List {} Generic Methods void m(T a){ Generic Wildcards Class c;
9
9 JLS4: New Language Features Diamond Map m = new HashMap<>(); Binary Literals int ONE = 0b001; int TWO = 0b010; int FOUR = 0b100; Underscore Literals int MILLION = 1_000_000; int MASK = 0xFF_FF_00; Safe Varargs @SafeVarargs static List asList(T... elems) { Multi-catch try {.. } catch (E1 | E2 e) {.. } Try with Resources try (File f = new..) {
10
10 Study Tools and Dataset Boa [ICSE'13] http://boa.cs.iastate.edu/java-features/ input = project 1 input = project 2 input = project 3 input = project n...... Dataset Boa Program...... Assert Assert[631152000] = 5 Assert[631154020] = 12 Assert[631161103] = 14 Assert[631172392] = 18. Output Assert[631152000] << 1; 631152000, 1 Assert[631154020] << 1; 631152000, 1 631154020, 1 631152000, 1 631154020, 1 631161103, 1 Processes
11
11 Study Dataset Projects31,432 Revisions4,298,309 Java Files9,093,216 Java File Snapshots28,747,948 AST Nodes18,323,905,323
12
Research Question 1 Are language features used before release? Yes!
13
Research Question 2 How frequently was each language feature used?
14
14 Project Histogram: Annotation Use
15
15 Project Density: Annotation Use
16
16 Some features popular
17
17 Some features popular. Why?
18
18 Some features popular. Why? List ArrayList Map HashMap Set Collection Vector Class Iterator HashSet (confirms [Parnin et al. MSR'11])
19
Research Question 3 How did committers adopt features? Adoption by individuals, not teams (confirms [Parnin et al. MSR'11])
20
Research Question 4 Could features have been used more?
21
21 Opportunity: Assert void m(..) { if (cond) throw new IllegalArgumentException();... } void m(..) { assert cond;... } Find methods that throw IllegalArgumentException. Simpler Machine-checkable Easily disabled for production
22
22 Opportunity: Varargs void m(a1, a2, T[] a3) { void m(a1, a2, T... a3) { Find methods that take arrays as last argument. m(..,.., new T[] {t 1, t 2,..}) { m(..,.., t 1, t 2,..) {
23
23 Opportunity: Binary Literals int x = 1 << 5; Find where literal 1 is shifted left. short[] phases = { 0x7, 0xE, 0xD, 0xB }; short[] phases = { 0b0111, 0b1110, 0b1101, 0b1011 };
24
24 Opportunity: Underscore Literals int x = 1000000; int x = 1_000_000; Find integers with 7 or more digits and no underscores.
25
25 Opportunity: Diamond List l = new ArrayList (); List l = new ArrayList<>(); Instantiation of generics not using diamond.
26
26 Opportunity: MultiCatch try {.. } catch (T1 e) { b1 } catch (T2 e) { b1 } try {.. } catch (T1 | T2 e) { b1 } A try with multiple, identical catch blocks.
27
27 Opportunity: Try w/ Resources try {.. } finally { var.close(); } try (var =..) {.. } Try statements calling close() in the finally block.
28
28 AssertVarargs Binary Literals DiamondMultiCatch Try w/ Resources Underscore Literals Old 89K612K56K3.3M341K489K5.3M New 291K1.6M5K414K24K33K507K Millions of opportunities!
29
Potential Uses Projects 18.18%88.78%5.9%59.08%49.75%37.27%51.15% 29 Actual Uses AssertVarargs Binary Literals DiamondMultiCatch Try w/ Resources Underscore Literals Projects 12.72%15.43%0.02%0.4%0.27%0.21%0.02% Millions of opportunities!
30
30 Impact: Potential for bugs BufferedReader br =...; String s = br.readLine(); br.close(); try (BufferedReader br =...;) { String s = br.readLine(); } throw new IOException();
31
31 Impact: Potential for bugs 193,768 instances sampling shows 50% accuracy Mine for methods that: 1. declare they throw IOException 2. do not catch IOException in body 3. contain a call to close() public void close() throws IOException { f.close(); } try {... } finally { f1.close(); f2.close(); } try { sock.close(); rec.close(); } catch (Exception e) { }
32
Research Question 5 Was old code converted to use new features?
33
33 Detecting Conversions potential N uses N potential N+1 uses N+1 uses N < uses N+1 potential N > potential N+1 File.java (Revision N) File.java (Revision N+1)
34
34 Detected lots of conversions! manual, systematic sampling confirms 2602 conversions 13 not conversions AssertVarargsDiamondMultiCatch Try w/ Resources Underscore Literals Count 1802.1K8.5K1621542 Files 1051.6K3.8K125991 Projects 374887223171
35
35 Similar usage patterns AssertVarargsDiamondMultiCatch Try w/ Resources Underscor e Literals Count 1802.1K8.5K1621542 Files 1051.6K3.8K125991 Projects 374887223171 Old code converted to use new features Only few features see high use AssertVarargs Binary Literals DiamondMultiCatch Try w/ Resources Underscore Literals Old 89K612K56K3.3M341K489K5.3M New 291K1.6M5K414K24K33K507K All 380K2.2M61K3.7M365K522K5.8M Files 1.39%12.74%0.11%12.25%2.28%1.85%5.86% Projects 18.18%88.78%5.9%59.08%49.75%37.27%51.15% Despite (missed) potential for use Feature adoption by individuals To summarize...
36
36 Call to action!
37
37 Thank you! http://boa.cs.iastate.edu/java-features/
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.