Learning to Predict User-Defined Types

TypeScript is a widely adopted gradual typed language where developers can optionally type variables, functions, parameters and more. Probabilistic type inference approaches with ML (machine learning) work well especially for commonly occurring types such as boolean , number , and string . TypeScrip...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on software engineering Vol. 49; no. 4; pp. 1508 - 1522
Main Authors: Jesse, Kevin, Devanbu, Premkumar T., Sawant, Anand
Format: Journal Article
Language:English
Published: New York IEEE 01-04-2023
IEEE Computer Society
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:TypeScript is a widely adopted gradual typed language where developers can optionally type variables, functions, parameters and more. Probabilistic type inference approaches with ML (machine learning) work well especially for commonly occurring types such as boolean , number , and string . TypeScript permits a wide range of types including developer defined class names and type interfaces. These developer defined types, termed user-defined types , can be written within the realm of language naming conventions. The set of user-defined types is boundless and existing bounded type guessing approaches are an imperfect solution. Existing works either under perform in user-defined types or ignore user-defined types altogether. This work leverages a BERT-style pre-trained model, with multi-task learning objectives, to learn how to type user-defined classes and interfaces. Thus we present DiverseTyper , a solution that explores the diverse set of user-defined types by uniquely aligning classes and interfaces declarations to the places in which they are used. DiverseTyper surpasses all existing works including those that model user-defined types.
ISSN:0098-5589
1939-3520
DOI:10.1109/TSE.2022.3178945