derive4j / derive4j Goto Github PK

Java 8 annotation processor and framework for deriving algebraic data types constructors, pattern-matching, folds, optics and typeclasses.

Java 100.00%

algebraic-data-types optics functional-programming algebra visitor-pattern annotation-processor java-8 derive4j catamorphisms visitor

derive4j's Introduction

Derive4J: Java 8 annotation processor for deriving algebraic data types constructors, pattern matching and more!

tl;dr Show me how to write, say, the Either sum type with Derive4J!.

Example: a 'Visitor' for HTTP Request
Smart constructors
Static methods export
Updating deeply nested immutable data structure
Popular use-case: domain specific languages
Catamorphisms
Extensible algebraic data types
But what exactly is generated?
Parametric polymorphism
Generalized Algebraic Data Types
DRY annotation configuration
Use it in your project
Changelog
Contributing
Contact

Caution: if you are not familiar with Algebraic Data Types or the "visitor pattern" then you may want to learn a bit about them.

So, what can this project do for us, poor functional programmers stuck with a legacy language called Java? A good deal of what is commonly available in better languages like Haskell, including:

structural pattern matching on Algebraic data types, with compile-time exhaustiveness/redundancy check,
laziness (a value can be a memoized thunk),
automatic type class derivation
Generalised algebraic data types
combinators implementing lenses, prisms and optionals.

Algebraic data types come in two flavours, product types and sum types. This readme focus on sum types because it is the more interesting case; product types being the well known common case in Java, but Derive4J handles product types in exactly the same fashion (ie. through a visitor interface with a single abstract method).

Example: a 'Visitor' for HTTP Request

Let's say we want to model an HTTP request. For the sake of the example let's say that an HTTP request can either be

a GET on a given path
a DELETE on a given path
a POST of a content body on a given path
a PUT of a content body on a given path

and nothing else!

You could then use the corrected visitor pattern and write the following class in Java:

package org.derive4j.example;

/** A data type to model an http request. */
@Data
public abstract class Request {

  /** the Request 'visitor' interface, R being the return type
   *  used by the 'accept' method : */
  interface Cases<R> {
    // A request can either be a 'GET' (of a path):
    R GET(String path);
    // or a 'DELETE' (of a path):
    R DELETE(String path);
    // or a 'PUT' (on a path, with a body):
    R PUT(String path, String body);
    // or a 'POST' (on a path, with a body):
    R POST(String path, String body);
    // and nothing else!
  }

  // the 'accept' method of the visitor pattern:
  public abstract <R> R match(Cases<R> cases);

  /**
   * Alternatively and equivalently to the visitor pattern above, if you prefer a more FP style,
   * you can define a catamorphism instead. (see examples)
   * (most useful for standard data type like Option, Either, List...)
   */
}

Constructors

Without Derive4J, you would have to create subclasses of Request for all four cases. That is, write at the minimum something like:

  public static Request GET(String path) {
    return new Request() {
      @Override
      public <R> R match(Cases<R> cases) {
        return cases.GET(path);
      }
    };}

for each case. But thanks to the @Data annotation, Derive4j will do that for you! That is, it will generate a Requests class (the name is configurable, the class is generated by default in target/generated-sources/annotations when using Maven) with four static factory methods (what we call 'constructors' in FP):

  public static Request GET(String path) {...}
  public static Request DELETE(String path) {...}
  public static Request PUT(String path, String body) {...}
  public static Request POST(String path, String body) {...}

You can also ask Derive4J to generate null checks with:

@Data(arguments = ArgOption.checkedNotNull)

equals, hashCode, toString?

Derive4J philosophy is to be as safe and consistent as possible. That is why Object.{equals, hashCode, toString} are not implemented by generated classes by default (they are best kept ignored as they break parametricity). Nonetheless, as a concession to legacy, it is possible to force Derive4J to implement them, by declaring them abstract. Eg by adding the following in your annotated class:

  @Override
  public abstract int hashCode();
  @Override
  public abstract boolean equals(Object obj);
  @Override
  public abstract String toString();

The safer solution would be to never use those methods and use 'type classes' instead, eg. Equal, Hash and Show. The project Derive4J for Functional Java aims to generate them automatically.

Pattern matching syntaxes

Now let's say that you want a function that returns the body size of a Request. Without Derive4J you would write something like:

  static final Function<Request, Integer> getBodySize = request -> 
      request.match(new Cases<Integer>() {
        public Integer GET(String path) {
          return 0;
        }
        public Integer DELETE(String path) {
          return 0;
        }
        public Integer PUT(String path, String body) {
          return body.length();
        }
        public Integer POST(String path, String body) {
          return body.length();
        }
      });

With Derive4J you can do that a lot less verbosely, thanks to a generated fluent structural pattern matching syntaxes! And it does exhaustivity check! (you must handle all cases). The above can be rewritten into:

static final Function<Request, Integer> getBodySize = Requests.cases()
      .GET_(0) // shortcut for .Get(path -> 0)
      .DELETE_(0)
      .PUT((path, body)  -> body.length())
      .POST((path, body) -> body.length())

or even (because you don't care of GET and DELETE cases):

static final Function<Request, Integer> getBodySize = Requests.cases()
      .PUT((path, body)  -> body.length())
      .POST((path, body) -> body.length())
      .otherwise_(0)

Derive4j also allows to match directly against a value:

static int getBodyLength(Request request) {
  return Requests.caseOf(request)
      .PUT((path, body)  -> body.length())
      .POST((path, body) -> body.length())
      .otherwise_(0)
}

Accessors (getters)

Now, pattern matching every time you want to inspect an instance of Request is a bit tedious. For this reason Derive4J generates 'getter' static methods for all fields. For the path and body fields, Derive4J will generate the following methods in the Requests class:

  public static String getPath(Request request){
    return Requests.cases()
        .GET(path          -> path)
        .DELETE(path       -> path)
        .PUT((path, body)  -> path)
        .POST((path, body) -> path)
        .apply(request);
  }
  // return an Optional because the body is not present in the GET and DELETE cases:
  static Optional<String> getBody(Request request){
    return Requests.cases()
        .PUT((path, body)  -> body)
        .POST((path, body) -> body)
        .otherwiseEmpty()
        .apply(request);
  }

(Actually the generated code is equivalent but more efficient)

Using the generated getBody methods, we can rewrite our getBodySize function into:

static final Function<Request, Integer> getBodySize = request ->
      Requests.getBody(request)
              .map(String::length)
              .orElse(0);

Functional setters ('withers')

The most painful part of immutable data structures (like the one generated by Derive4J) is updating them. Scala case classes have copy methods for that. Derive4J generates similar modifier and setter methods in the Requests class:

  public static Function<Request, Request> setPath(String newPath){
    return Requests.cases()
            .GET(path          -> Requests.GET(newPath))
            .DELETE(path       -> Requests.DELETE(newPath))
            .PUT((path, body)  -> Requests.PUT(newPath, body))
            .POST((path, body) -> Requests.POST(newPath, body)));
  }
  public static Function<Request, Request> modPath(Function<String, String> pathMapper){
    return Requests.cases()
            .GET(path          -> Requests.GET(pathMapper.apply(path)))
            .DELETE(path       -> Requests.DELETE(pathMapper.apply(path)))
            .PUT((path, body)  -> Requests.PUT(pathMapper.apply(path), body))
            .POST((path, body) -> Requests.POST(pathMapper.apply(path), body)));
  }
  public static Function<Request, Request> setBody(String newBody){
    return Requests.cases()
            .GET(path          -> Requests.GET(path))    // identity function for GET
            .DELETE(path       -> Requests.DELETE(path)) // and DELETE cases.
            .PUT((path, body)  -> Requests.PUT(path, newBody))
            .POST((path, body) -> Requests.POST(path, newBody)));
  }
  ...

By returning a function, modifiers and setters allow for a lightweight syntax when updating deeply nested immutable data structures.

First class laziness

Languages like Haskell provide laziness by default, which simplifies a lot of algorithms. In traditional Java you would have to declare a method argument as Supplier<Request> (and do memoization) to emulate laziness. With Derive4J that is no more necessary as it generates a lazy constructor that gives you transparent lazy evaluation for all consumers of your data type:

  // the requestExpression will be lazy-evaluated on the first call
  // to the 'match' method of the returned Request instance:
  public static Request lazy(Supplier<Request> requestExpression) {
    ...
  }

Have a look at List for how to implement a lazy cons list in Java using Derive4J (you may also want to see the associated generated code).

Flavours

In the example above, we have used the default JDK flavour. Also available are FJ (Functional Java), Fugue (Fugue), Javaslang/Vavr (Vavr), HighJ (HighJ), Guava and Cyclops (Cyclops-react) flavours. When using those alternative flavours, Derive4J will use eg. the specific Option implementations from those projects instead of the jdk Optional class.

Optics (functional lenses)

If you are not familiar with optics, have a look at Monocle (for Scala, but Functional Java provides similar abstraction).

Using Derive4J generated code, defining optics is a breeze (you need to use the FJ flavour by specifying @Data(flavour = Flavour.FJ):

  /**
   * Lenses: optics focused on a field present for all data type constructors
   * (getter cannot 'failed'):
   */
  public static final Lens<Request, String> _path = lens(
      Requests::getPath,
      Requests::setPath);
  /**
   * Optional: optics focused on a field that may not be present for all constructors
   * (getter return an 'Option'):
   */
  public static final Optional<Request, String> _body = optional(
      Requests::getBody,
      Requests::setBody);
  /**
   * Prism: optics focused on a specific constructor:
   */
  public static final Prism<Request, String> _GET = prism(
      // Getter function
      Requests.cases()
          .GET(fj.data.Option::some)
          .otherwise(Option::none),
      // Reverse Get function (aka constructor)
      Requests::GET);

  // If there is more than one field, we use a tuple as the prism target:
  public static final Prism<Request, P2<String, String>> _POST = prism(
      // Getter:
      Requests.cases()
          .POST((path, body) -> p(path, body))
          .otherwiseNone(),
      // reverse get (construct a POST request given a P2<String, String>):
      p2 -> Requests.POST(p2._1(), p2._2()));
}

Smart constructors

Sometimes you want to validate the constructors parameters before returning an instance of a type. When using the Smart visibity (@Data(@Derive(withVisibility = Visibility.Smart))), Derive4J will not expose "raw" constructors and setter as public, but will use package private visibility for those methods instead (getters will still be public).

Then you expose a public static factory method that will do the necessary validation of the arguments before returning an instance (typically wrapped in a Option/Either/Validation), and that public factory will be the only way to get an instance of that type.

See usage of this feature in PersonName.

Static methods export

It is generally considered good style to keep static methods and instance methods separated, especially because overloads could cause ambiguities on usage as method references.

The Java file generated by derive4j contains only static methods, so it makes sense to use this class as main entry point for the static part of the data type API.

To this end, Derive4J support re-exporting of your own manually-written static methods as part of the generated class API. It can do so in two ways (that can be combined):

by specifying that the main generated class must extends a given class eg. MyStaticMethods.class, thus exposing all its static methods through inheritance.
by annotating your package-private static methods with @ExportAsPublic: Derive4J will generate public forwarding methods in the generated class, and, as bonus, it will memoize the result of nullary methods.

@Data(@Derive(extend = MyStaticMethods.class))
public abstract class List<A> {
  // package-private static class with public static methods:
  static abstract class MyStaticMethods {
    public static <A> List<A> singleton(A a) {
      return Lists.cons(a, Lists.nil())
    }
  }
  // Or use the annotation, either in the above MyStaticMethods class
  // or directly in the data type class:
  @ExportAsPublic
  static <A> List<A> singleton(A a) {
    return Lists.cons(a, Lists.nil())
  }
}

public static void main(final String[] args) {
  // enjoy single access points for all static methods:
  List<String> a = Lists.singleton("a");
}

See usage of this feature in PersonName.

Updating deeply nested immutable data structure

Let's say you want to model a CRM. Each client is a Person who can be contacted by email, by telephone or by postal mail. With Derive4J you could write the following:

import org.derive4j.*;
import java.util.function.BiFunction;

@Data
public abstract class Address {
  public abstract <R> R match(@FieldNames({"number", "street"}) 
  			      BiFunction<Integer, String, R> Address);
}

import org.derive4j.Data;

@Data
public abstract class Contact {
    interface Cases<R> {
      R byEmail(String email);
      R byPhone(String phoneNumber);
      R byMail(Address postalAddress);
    }
    public abstract <R> R match(Cases<R> cases);
}

import org.derive4j.*;
import java.util.function.BiFunction;

@Data
public abstract class Person {
  public abstract <R> R match(@FieldNames({"name", "contact"})
                              BiFunction<String, Contact, R> Person);
}

But now we have a problem: All the clients have been imported from a legacy database with an off-by-one error for the street number! We must create a function that increments each Person's street number (if it exists) by one. And we have to do this without modifying the original data structure (because it is immutable). With Derive4J, writing such a function is trivial:

import java.util.Optional;
import java.util.function.Function;

import static org.derive4j.example.Addresss.Address;
import static org.derive4j.example.Addresss.getNumber;
import static org.derive4j.example.Addresss.modNumber;
import static org.derive4j.example.Contacts.getPostalAddress;
import static org.derive4j.example.Contacts.modPostalAddress;
import static org.derive4j.example.Persons.Person;
import static org.derive4j.example.Persons.getContact;
import static org.derive4j.example.Persons.modContact;

  public static void main(String[] args) {

    Person joe = Person("Joe", Contacts.byMail(Address(10, "Main St")));

    Function<Person, Person> incrementStreetNumber = modContact(
    						       modPostalAddress(
    						         modNumber(number -> number + 1)));
    
    // correctedJoe is a copy of joe with the street number incremented:
    Person correctedJoe = incrementStreetNumber.apply(joe);

    Optional<Integer> newStreetNumber = getPostalAddress(getContact(correctedJoe))
        .map(postalAddress -> getNumber(postalAddress));

    System.out.println(newStreetNumber); // print "Optional[11]" !!
  }

Popular use-case: domain specific languages

Algebraic data types are particulary well fitted for creating DSLs. A calculator for arithmetic expressions could be built like this:

import java.util.function.Function;
import org.derive4j.Data;
import static org.derive4j.example.Expressions.*;

@Data
public abstract class Expression {

	interface Cases<R> {
		R Const(Integer value);
		R Add(Expression left, Expression right);
		R Mult(Expression left, Expression right);
		R Neg(Expression expr);
	}
	
	public abstract <R> R match(Cases<R> cases);

	private static Function<Expression, Integer> eval = Expressions
		.cases()
			.Const(value        -> value)
			.Add((left, right)  -> eval(left) + eval(right))
			.Mult((left, right) -> eval(left) * eval(right))
			.Neg(expr           -> -eval(expr));

	public static Integer eval(Expression expression) {
		return eval.apply(expression);
	}

	public static void main(String[] args) {
		Expression expr = Add(Const(1), Mult(Const(2), Mult(Const(3), Const(3))));
		System.out.println(eval(expr)); // (1+(2*(3*3))) = 19
	}
}

Catamorphisms

are generated for recursively defined datatypes. So that you can rewrite the above eval method into:

	public static Integer eval(Expression expression) {
		Expressions
		     .cata(
		        value -> value,
		        (left, right) -> left + right,
		        (left, right) -> left * right,
		        expr -> -expr,
		        Supplier::get
		     )
		     .apply(expression)
	}

The last parameter (Supplier::get above) specify how recursive calls are suspended. Using Supplier::get means that the computation is not suspended: for deep structures it may blow the stack!

To be safe, use the lazy, (or delay or suspend or defer...) constructor of your result type, such as the lazy constructor generated by Derive4J.

If no such constructor is available then your safe option is to use a Trampoline, such as the one provided by FunctionalJava:

public static Integer stackSafeEval(Expression expression) {
    Expressions.cata(
        value -> Trampoline.pure(value),
        (left, right) -> left.zipWith(right, (l, r) -> l + r),
        (left, right) -> left.zipWith(right, (l, r) -> l * r),
        expr -> expr.map(i -> -i),
        Trampoline::suspend
    ).f(expression).run();
}

Extensible algebraic data types

Algebraic data types defined as fix-point (aka initial algebra) of an object algebras can enjoy their extensibility properties.

When the data type is not inductive the extensibility property comes directly from covariance

Eg. an event type for an inventory service:

  @Data
  interface EventV1 {

    interface Cases<R> {
      R newItem(Long ref, String itemName);
      R itemRemoved(Long ref);
    }

    <R> R match(Cases<R> cases);
  }

Then comes a new version of the service, with enriched events and new cases. If the visitor for the new event type extend the old visitor interface then old events can be easily converted to new events, without change to the old classes:

  @Data
  interface EventV2 {

    interface Cases<R> extends EventV1.Cases<R> { // extends V1 with:

      // new `initialStock` field in `newItem` event:
      R newItem(Long ref, String itemName, int initialStock);
      // default to 0 for old events:
      @Override
      default R newItem(Long ref, String itemName) {
        return newItem(ref, itemName, 0);
      }
      // new event:
      R itemRenamed(Long ref, String newName);
    }

    <R> R match(Cases<R> cases);

    static EventV2 fromV1(EventV1 v1Event) {
      // Events are (polymorphic) functions!
      // And functions are contra-variant in type argument,
      // thus we can use method reference to convert from V1 to V2:
      return v1Event::match;
    }
  }

Extensible inductive data types via hylomorphisms

Aka solving the expression problem via object-algebras used as visitor. For this, we need to slightly change the visitor of the above Expression so that a type variable (E) is used instead of the self-reference:

@Data
interface Exp {

  interface ExpAlg<E, R> {
    R Lit(int lit);
    R Add(E e1, E e2);
  }

  <R> R accept(ExpAlg<Exp, R> alg);
}

When data types are defined is such a way (as a fix-point of the algebra), Derive4J generate (by default) an instance of the visitor/algebra that can serve as factory (aka. anamorphism). Using this factory as an argument to compatible catamorphism (thus creating a hylomorphism) we obtain a conversion function from one ADT to another.

Eg. we can create a new data type that add a multiplication case to the above data type, and still be able to maximally reuse the existing code without modification:

@Data
interface ExpMul {

  interface ExpMulAlg<E, R> extends Exp.ExpAlg<E, R> {
    R Mul(E e1, E e2);
  }

  <R> R accept(ExpMulAlg<ExpMul, R> alg);

  static Function<Exp, ExpMul> fromExp() {
    ExpMulAlg<ExpMul, ExpMul> factory = ExpMuls.factory();
    return Exps.cata(factory, ExpMuls::lazy);
  }
}

To ensure smooth extensibility across compilation unit (or even during incremental compilation), it is best to use the -parameters option of javac.

But what exactly is generated?

This is a very legitimate question. Here is the ExpMuls.java file that is generated for the above @Data ExpMul type.

Parametric polymorphism

... works as expected. For example, you can write the following:

import java.util.function.Function;
import java.util.function.Supplier;
import org.derive4j.Data;

@Data
public abstract class Option<A> {

    public abstract <R> R cata(Supplier<R> none, Function<A, R> some);

    public final <B> Option<B> map(final Function<A, B> mapper) {
        return Options.modSome(mapper).apply(this);
    }
}

The generated modifier method modSome allows polymorphic update and is incidentaly the functor for our Option!

Generalized Algebraic Data Types

GADTs are also supported out of the box by Derive4J (within the limitations of the Java type system). Here is how you can translate the example from Fun with phantom types:

import org.derive4j.hkt.TypeEq;

@Data
public abstract class Term<T> {
  interface Cases<A, R> {
    R Zero(TypeEq<Integer, A> id);
    R Succ(Term<Integer> pred, TypeEq<Integer, A> id);
    R Pred(Term<Integer> succ, TypeEq<Integer, A> id);
    R IsZero(Term<Integer> a, TypeEq<Boolean, A> id);
    R If(Term<Boolean> cond, Term<A> then, Term<A> otherwise);
  }

  public abstract <X> X match(Cases<T, X> cases);

  public static <T> T eval(final Term<T> term) {

    return Terms.caseOf(term).
        Zero(id -> id.coerce(0)).
        Succ((t, id) -> id.coerce(eval(t) + 1)).
        Pred((t, id) -> id.coerce(eval(t) - 1)).
        IsZero((t, id) -> id.coerce(eval(t) == 0)).
        If((cond, then, otherwise) -> eval(cond)
            ? eval(then)
            : eval(otherwise));
  }

  public static void main(final String[] args) {

    Term<Integer> one = Succ(Zero());
    out.println(eval(one)); // "1"
    out.println(eval(IsZero(one))); // "false"
    // IsZero(IsZero(one)); // does not compile:
    // "The method IsZero(Term<Integer>) in the type Term<T> is not
    // applicable for the arguments (Term<Boolean>)"
    out.println(eval(If(IsZero(one), Zero(), one))); // "1"
    Term<Boolean> True = IsZero(Zero());
    Term<Boolean> False = IsZero(one);
    out.println(eval(If(True, True, False))); // "true"
    // out.println(prettyPrint(If(True, True, False), 0)); // "if IsZero(0)
    //  then IsZero(0)
    //  else IsZero(Succ(0))"
  }
}

For GADT you will need to add a dependency on derive4j/hkt which provides TypeEq<A, B>: a witness of the equality of two types, A and B.

DRY annotation configuration

By default the @Data annotation triggers the generation of everything which is available, in a file whose name is the English plural of the annotated class. But you may want to restrict the scope of what is generated, or change the name of the file, and you usually want all you ADTs to use the same flavour. You may even dislike the name of the annotation because it clashes with another framework...

For example, let's say that you want to always use the FJ flavour (FunctionalJava), make the generated code package private in a class suffixed by Impl and only generate the pattern matching syntax and the constructors. Then all you have to do is to create the following annotation:

@Data(flavour = Flavour.FJ, value = @Derive(
    inClass = "{ClassName}Impl",
    withVisibility = Visibility.Package,
    make = { Make.constructors, Make.caseOfMatching }
))
public @interface myADT {}

And you annotate your classes with @myADT instead of @Data, saving on that configuration every time.

But now for some of your ADTs you may want to also generate getters and functional setters. In order to not lose the benefits of your @myADT, derive4j allows you to do this:

@myADT
@Derive(make = { Make.getters, Make.modifiers }) // add-up to the @myADT configuration
public abstract class Adt {...}

Use it in your project

Derive4J should be declared as a compile-time only dependency (not needed at runtime). So while derive4j is (L)GPL-licensed, the generated code is not linked to derive4j, and thus derive4j can be used in any project (proprietary or not).

Maven:

<dependency>
  <groupId>org.derive4j</groupId>
  <artifactId>derive4j</artifactId>
  <version>1.1.1</version>
  <optional>true</optional>
</dependency>

Gradle

compile(group: 'org.derive4j', name: 'derive4j', version: '1.1.1', ext: 'jar')

or better using the gradle-apt-plugin:

compileOnly "org.derive4j:derive4j-annotation:1.1.1"
apt "org.derive4j:derive4j:1.1.1"

Contributing

Bug reports and feature requests are welcome, as well as contributions to improve documentation.

Right now the codebase is not ready for external contribution (many blocks of code are more complicated than they should be). So you might be better off waiting for the resolution of #2 before trying to dig into the codebase.

Contact

[email protected], @jb9i or use the project GitHub issues.

Thanks

This project has a special dedication to Tony Morris' blog post Debut with a catamorphism. I'm also very thankful to @sviperll and his adt4j project which was the initial inspiration for Derive4J.

derive4j's People

Contributors

Stargazers

Watchers

derive4j's Issues

Add support for a HighJ flavour

https://github.com/DanielGronau/highj

GADTs : provide an 'Id' functional interface for type association (unification ?)

Jamais deux sans trois,

Re-reading my example :

abstract class Seminar<T> {
    Seminar() {}
    interface Cases<R, T> {
        R Init(SData.Core core, F<Init, T> __);
        R Temp(SData.Core core, SData.Add1 add1, F<Temp, T> __);
    }

  public static final class Init { private Init() {} }
  public static final class Temp { private Temp() {} }
}

I realize that the free F type (any functional interface would do, right ?) used to associate the T type parameter with a concrete (Init or Temp) type could be profitably (for the newcomer) replaced by a fixed Id type provided by derive4j.

What do you think ? Wouldn't it make the intent of that mysterious additional parameter clearer to the boeotian (that I once was) ?

Alternative lazy value implementations

Additionally from the current synchronized-based implementation, two alternative implementations should be available and configurable via annotation:

one based on AtomicReference<Adt>
one based on AtomicReference<WeakReference<Adt>>

which allow better throughput than the synchronized based implementation at the price of possible concurrent evaluation.

Add support for JavaSlang 2.0 as a Flavour

After playing with Derive4J for even just an evening, I think having support for JavaSlang from @danieldietrich would be a great addition once the 2.0 release it made.

This would be a natural fit alongside the other flavours - reusing JavaSlangs Option and Function2. Adding generators for tryEmailAddress along side getEmailAddress when using JavaSlang flavour may also be a potentially nice addition.

Add support for a la carte derivation

Last week I was experimenting with reworking some existing $work code ( using jADT ) to use derive4j and whilst it worked well, and the code translated fairly seamlessly, I started noticed weird parse/completion errors in IntelliJ.

It seems IntelliJ by default stops parsing files if there over 2500 lines long, and the derive4j generated source file in question was something like 60,000 lines.

In this particular instance, once an object has been created, we have no desired to have mutations, so don't really need any of the set/mod support functions to get generated ( also saving a lot of lambdas being compiled ).

As a simple solution ( other than increasing the rather hidden setting for IntelliJ completion ) was to have a new Flavour.None for derive4j that prevented the generation of get/set/mod methods, leaving you only with the constructors.

Would something like this be useful to anyone else? Or would a more generic (disable set, disable mod, disable get) generation control across flavours be more useful?

Generate narrow/coerce method for higher kinded types

For class that implements https://github.com/derive4j/hkt/blob/master/src/main/java/org/derive4j/hkt/__.java

Now I'm not sure of what the method name should be... (configurable?)

Typo: @Derive.withVisbility => withVisibility

(and also rename module 'exemple' => 'example')

explicit error message for @Data on non static class

This is a common oversight when creating inner class. See #14.

Make the @Data annotation available for package (package-info.java)

Components of the annotation would then be overridable:
package and class annotated deeper in the package hierarchy override the configuration found in annotated packages closer to the root.

This will alleviate the need to repeat the same @Data configuration over and over on each data type in a given project.
Also the name of the generated class will support template, ie something like @Derive(inClass="{ClassName}Impl").

GADT : total getters for specific types

Hi Jean-Baptiste,

Let's say we have the following GADT :

@data
interface Seminar<T> {
  interface Cases<R, T> {
    R Init(SData.Core core, TypeEq<Init, T> __);
    R Temp(SData.Core core, SData.Add1 add1, TypeEq<Temp, T> __);
  }
  <R> R match(Cases<R, T> cases);

  enum Init {}
  enum Temp {}
}

then the following function

// getter not returning an Option<SData.Add1>
public static SData.Add1 getAdd1(Seminar<Temp> temp) {
  return Seminars.getAdd1(temp).some(); // cannot fail since we have a specific Seminar<Temp>
}

has to be written by hand, as you can see.

Would it be possible, in the case of GADTs, to generate total getters for specific cases ?

Manage annotated inner classes

Given these classes :

@Data(flavour = Flavour.FJ)
public abstract class Product<T> {
  // all the stuff needed...

  // ...and then
  @Data(flavour = Flavour.FJ)
  public abstract class Seminar<T> {
     // all the stuff needed...
  } // end of Seminar

} // end of Product

only the Products class gets generated. It would be nice if annotated inner classes could be parsed and get their utilities generated just as top level ones.

Derive catamorphism for recursively defined data types

Ability to add annotations to generated classes

It would often be nice to have sensible ToString implementations on the generated classes. One way of providing toString() is to use lombok's @tostring annotation on the generated class. This could be enabled by being able to specify which annotations should apply to the generated classes.

Do not generate mapper interface for no-arg/one-arg constructors

Use the flavour F0/Supplier or F/Function interfaces instead. This will save some classes and facilitate composition.

Add intermediate visibility to hide constructors and setters

You need smart constructors for your data type when the constraints of your model cannot be made practical by the type system. Eg. you need to validate at runtime the input needed to construct the data type with constructors of the form input -> Option ADT instead of normal input -> ADT constructors generated by derive4J.
If that happens then you cannot expose constructors and setters generated by Derive4J (all data constructions must go through the smart constructors).

To support this idiom, we need a Visbility.Smart (better name?) that will only expose getters and pattern matching as public (rather, 'same' visibiliy) and generate constructors and setters method with visbility 'package'.

Support subtyping of visitor

ie. derive4j must look for not implemented abstract method also in the interfaces and/or super class, recursively.
This allow to use techniques described in http://ropas.snu.ac.kr/~bruno/papers/ecoop2012.pdf

Potential identifers clash in generated code

There is some possibilities that the generated classes does not compile due to clash of identifers (eg. identifier originating from user code clash with hard-coded identifier in generated code).
This should not happen in most cases, but we should ensure hygienism of all generated code.
Will be facilitated by square/javapoet#338
Should go hand in hand with #2.

Publish to maven central

That will probably be my greatest achievement!

Export static package-private methods annotated with @Export as public in generated class

This allows to use exclusively the generated class for all static api.
This should also addresses #41.

Also, as bonus, all static no-args methods will be exported with an additional caching of the returned value.
/cc @talios: WDYT?

Generation fails for sum type + "lambda visitor" + non default Flavour

see report in datatyped/pfds-java@3487a21#commitcomment-16018772 by @rdegnan

More descriptive name than `@Data`

What a nice processor!

I would ask you use a more descriptive name than @Data for annotation:

"Data" is such a generic word, difficult to scan code with @Data and grok something interesting is happening.
"Data" may clash with user code or other packages which also have a Data class, especially after imports.
In particular, Lombok is probably the most popular 3rd-party annotation processor, and it, too, has a Data class, which does something rather different.
This package is version 0.9.1 (as of when I wrote this), so you should have some freedom in refactoring before hitting the magic 1.0.0 and your public API is frozen (assuming semantic versioning).

You have many fine alternative choices for naming the annotation. First to my mind is @ADT. A web search on "ADT" turns up near the top the right Wikipedia page to understand some background on this library's goals.

Cheers,
--binkley

Eclipse (ecj) support

When compiling with ecj, total pattern matching syntax does not follow definition order in source file.
Also some generated code does not compile due poor type inference.

Additionnal facilities for partial matching

today there is only

Function<ADT, R> otherwise(Supplier<R> expression)

I think it should be renamed into

Function<ADT, R> otherwiseEval(Supplier<R> expression)

And then we can add:

Function<ADT, R> otherwise(R value)
Function<ADT, Option<R>> otherwiseNone()
Function<ADT, Either<L, R>> <L> otherwiseLeft(L leftValue)
Function<ADT, Validation<E, R>> <E> otherwiseFail(E error)

What others think about that feature? (are naming good?)

Auto-generate a Step Builder?

Can be handy if a data constructor has a lot of fields, and you want to insure the user of your data structure supplies the fields in the correct order. (avoid mixing up fields with the same type, like width and height)

Readme.md code typo

First code showcase:

public abstract <R> R match(Cases<X> cases);

Should probably be Cases<R>

Support for optional nullchecks in constructors

Although not necessary if you never use null (compiler plugin, anyone?), a mechanism ensuring that null never slip into a data-structure is expected by some user.
Null checks in generated code will be triggered by:
@Data(arguments = checkedNotNull)

Compile fails when using a field of primitive type

when the field is not present for all constructors.

Unable to use primitives in values

I changed one of my objects to include a boolean and got this:

[INFO] Compiling 9 source files to /Users/amrk/IdeaProjects/upstream/halbuilder/halbuilder-core/target/classes
java.lang.IllegalArgumentException: boolean
    at com.sun.tools.javac.model.JavacTypes.getDeclaredType0(JavacTypes.java:252)
    at com.sun.tools.javac.model.JavacTypes.getDeclaredType(JavacTypes.java:222)
    at org.derive4j.processor.DeriveUtilsImpl.resolve(DeriveUtilsImpl.java:137)
    at org.derive4j.processor.derivator.GettersDerivator.visitorDispatchLensGetterImpl(GettersDerivator.java:197)
    at org.derive4j.processor.derivator.GettersDerivator.lambda$generateLensGetter$10(GettersDerivator.java:173)

I guess derive4j doesn't support primitives? Should it? Or should gracefully fail, I switched to Boolean for now anyway.

Generated equals method wrongly use reference equality for type variables

Fields whose type correspond to a type variable are (wrongly) compared using reference equality instead of Object#equals.

Avoid overloading in pattern matching synthax.

Since the time I closed #8, I encountered quite a few place where the overload was ambiguous (at least for javac) and had to use explicit type annotation to fix them. This was annoying and I'm thinking of avoiding overloading for the next release.

This means adding the suffix _ to the overload that take a plain (constant) value as argument.

This suffix is consistent with the syntax in haskell/scala where _ is a place holder for an ignored value.

Add Guava flavor

for projects stuck on Java 7

Provide a means for supplying a validator

I was thinking it would be handy to have some means of providing a validator over the ADT values, I often use this feature of Immutables and keep thinking it would be useful here.

Currently, derive4j checks for nullness in its created instances, but it would be nice if we could define something like:

protected|public void validate() {
  this.match(....)
}

protected|public static Predicate<MyAdtType> validator = it -> ....;

which if existed, gets called before returning the constructed value.

Thoughts?

Add support for varargs in datatype definitions

Would be it possible to add support for varargs:

interface Cases<R> {
  R develop(Job job, Job... dependencies);
  R review(Job job, Job... dependencies);
}

generators constructors like:

public static JobType develop(Job job, Job[] dependencies) {
...

It would be handy if the constructor method matched the vararg definition.

Existential (phantom ?) type parameters on constructors

Hey, me again,

the following haskell idiom seems currently not expressible using derive4j :

data Stream a = forall s. Stream (s -> Step a s) s

Would it be possible to allow that kind of construct :

@Derive4J(flavour = Flavour.FJ)
abstract class Stream<A> {
  interface Cases<R, A, S> {
    R Stream(Unlifted<S> s, F<S, Step<A, S>> stepper, S init);
  }
  abstract <R, S> R match(Cases<R, A, S> cases);
}

or better

@Derive4J(flavour = Flavour.FJ)
abstract class Stream<A> {
  interface Cases<R, A> {
    <S> R Stream(Unlifted<S> s, F<S, Step<A, S>> stepper, S init);
  }
  abstract <R> R match(Cases<R, A> cases);
}

What do you think ?

Catamorphism generated for GADT are not "Generalized"

Make them so might be hard...
http://stackoverflow.com/questions/17503131/how-to-make-catamorphisms-work-with-parameterized-indexed-types

Generation fail if a same type variable is used multiple time in a given field.

The following:

@Data(value = @Derive(inClass = "Draw2DFImpl", withVisibility = Visibility.Package), flavour = Flavour.FJ)
public abstract class Draw2DF<M,A> implements __2<Draw2DF.Mu,M,A> {
    public static class Mu {}

    public static <M,A> Draw2DF<M,A> narrow(__<__<Mu,M>,A> a) {
        return (Draw2DF<M,A>)a;
    }

    public interface Cases<R,M,A> {
        R GetTextSize(String text, F<V2<Double>,A> textSizeK);
        R GetTextXWrappedHeight(String text, Double width, F<Double,A> textHeightK);
        R WithScale(Double scale, FreeT<__<Mu,M>,M,A> drawing);
        R DrawTextXWrapped(String text, Double width, A next);
        R DrawText(Vector2D pos, String text, Double alignmentX, Double alignmentY, A next);
        R Draw(Shape2D shape, A next);
        R Fill(Shape2D shape, A next);
        R GetImageSize(Draw2D2.ImageSource imgSrc, F<V2<Double>,A> imageSizeK);
        R DrawImage(Draw2D2.ImageSource imgSrc, Vector2D pos, Double alignmentX, Double alignmentY, A next);
        R DrawImageScaled(Draw2D2.ImageSource imgSrc, Vector2D pos, Double width, Double height, Double alignmentX, Double alignmentY, A next);
        R GetCurrentFont(F<Draw2D2.Font,A> fontK);
        R WithFont(Draw2D2.Font font, FreeT<__<Mu,M>,M,A> drawing);
        R UseColour(Draw2D.Colour colour, FreeT<__<Mu,M>,M,A> drawing);
        R InSpace(Axes2D axes, FreeT<__<Mu,M>,M,A> drawing);
    }

    public abstract <R> R match(Cases<R,M,A> cases);
}

Has the error: Error:java: com.sm.fp.graphics.Draw2DF: tag M cannot be used for both 'M' and 'M_'

But when wrapped into a new type (Drawing), the following this works fine:

@Data(value = @Derive(inClass = "Draw2DFImpl", withVisibility = Visibility.Package), flavour = Flavour.FJ)
public abstract class Draw2DF<M,A> implements __2<Draw2DF.Mu,M,A> {
    public static class Mu {}

    public static <M,A> Draw2DF<M,A> narrow(__<__<Mu,M>,A> a) {
        return (Draw2DF<M,A>)a;
    }

    public interface Cases<R,M,A> {
        R GetTextSize(String text, F<V2<Double>,A> textSizeK);
        R GetTextXWrappedHeight(String text, Double width, F<Double,A> textHeightK);
        R WithScale(Double scale, Drawing<M,A> drawing);
        R DrawTextXWrapped(String text, Double width, A next);
        R DrawText(Vector2D pos, String text, Double alignmentX, Double alignmentY, A next);
        R Draw(Shape2D shape, A next);
        R Fill(Shape2D shape, A next);
        R GetImageSize(Draw2D2.ImageSource imgSrc, F<V2<Double>,A> imageSizeK);
        R DrawImage(Draw2D2.ImageSource imgSrc, Vector2D pos, Double alignmentX, Double alignmentY, A next);
        R DrawImageScaled(Draw2D2.ImageSource imgSrc, Vector2D pos, Double width, Double height, Double alignmentX, Double alignmentY, A next);
        R GetCurrentFont(F<Draw2D2.Font,A> fontK);
        R WithFont(Draw2D2.Font font, Drawing<M,A> drawing);
        R UseColour(Draw2D.Colour colour, Drawing<M,A> drawing);
        R InSpace(Axes2D axes, Drawing<M,A> drawing);
    }

    public abstract <R> R match(Cases<R,M,A> cases);

    public static class Drawing<M,A> {
        private final FreeT<__<Mu,M>,M,A> toFreeT;

        private Drawing(FreeT<__<Mu,M>,M,A> toFreeT) {
            this.toFreeT = toFreeT;
        }

        public static <M,A> Drawing<M,A> drawing(FreeT<__<Mu,M>,M,A> toFreeT) {
            return new Drawing<>(toFreeT);
        }
    }
}

Allow @Data on enums

It would be nice to be able to pattern match on enum. If derive4j detect that the annotated class is an enum then it should skip constructors generation.

Auto-generate Jackson (de)serializers

It would be nice to automatically generate Jackson serialization and deserialization code for Derive4J defined ADTs. The encoding seems fairly straightforward: for any subclass, serialize its attributes int a JSON object and additionally add a "type" field that is a string encoding of the variant name (i.e. the tag).

There are probably some corner cases, like if a subclass has an attribute named "type". So there may need to be an option to define the name of the field that holds the tag. The default could be "type".

This of course should be optional (opt-in), so users who don't want this code generated are not affected.

Possible ExceptionInInitializerError with static fields

Given:

@Data
abstract class Nat {
  interface Cases<R> {
    R Zero();
    R Suc(Nat nat);
  }
  public abstract <R> R match(Cases<R> cases);

  private static Function<Nat, Integer> eval =
     Nats.cases().Zero( ()        -> 0)
        .Suc(nat  -> eval(nat) + 1);

  public static Integer eval(Nat nat) {
    return eval.apply(nat);
  }
}

When executing

public class Main {
  public static void main(String[] args) {
    Nat nat = Suc(Suc(Suc(Suc(Zero()))));
    System.out.println(eval(nat));
  }
}

You get

Exception in thread "main" java.lang.ExceptionInInitializerError
    at org.jomaveger.lambda.adt.Nats.<clinit>(Nats.java:10)
    at org.jomaveger.lambda.main.Main.main(Main.java:15)
Caused by: java.lang.NullPointerException
    at org.jomaveger.lambda.adt.Nat.<clinit>(Nat.java:21)
    ... 2 more

Due to:

the Zero constructor being eagerly cached in a static final field in the generated class.
a static field in Nat make use (indirectly) of another static field of the generated class.
This cylcle in the initialization dependencies trigger a nullpointer exception.

To avoid this, the Zero constructor must not be eagerly initialized.

Compilation of product types fails

If there is only one constructor, generated code fails to compile.

Enum like facilities

For a given ADT, EnumAdt, where all constructors take 0 parameters, then derive4j should derive:

public static List<EnumAdt> values();
public static Option<EnumAdt> forConstructor(String);
public static String constructorName(EnumAdt);
public static int constructorOrdinal(EnumAdt);

The last two could be generated for all ADT (can be useful for serialization).

Better naming suggestions are welcomed!

Code cleanup / doc / refactoring

Apply DRY seriously: code base has too much duplicated code blocks.
Add missing doc comments for not trivial stuff.

Question: please clarify equals/hashCode/toString philosophy as stated in README

The README says:

Derive4J philosophy is to be as safe and consistent as possible. That is why Object.{equals, hashCode, toString} are not implemented by generated classes by default. Nonetheless, as a concession to legacy.

If you have a minute, can you please clarify what this means? Specifically, why is not having these methods "safe and consistent".

Thank you!

Smarter default name for generated classes.

Currently, by default, the generated class is the name of the class + 's'.
This issue is about making the default name to better follow English grammar for plural:

if the class ends with 'y' -> the 'y' is substituted with 'ies' for the generated class
if the class ends with s, x, ch, sh, or z then we add 'es' for the generated class.

There is also lots of other special cases but not sure they are worth it.

Additional "caseOf" pattern matching synthax

This idea is to support the following pattern matching scheme:

Request r = ...
int bodySize = Requests.caseOf(r)
      .GET(path          -> 0)
      .DELETE(path       -> 0)
      .PUT((path, body)  -> body.length())
      .POST((path, body) -> body.length())

[Q] Method must have one, and only one, type variable

Consider the following example:

@Data(flavour = Flavour.JDK)
public abstract class MyMap<A, B> {

  public interface MyMapVisitor<A, B> {
    MyMap<A, B> EmptyMap();
    MyMap<A, B> Insert(Pair<A, B> argPair, MyMap<A, B> argMap);
  }

  public abstract MyMap<A, B> match(MyMapVisitor<A, B> visitor);

}

and I get the following on mvn clean compile:

Method must have one, and only one, type variable (without bounds) 
that should also be the method return type.

which is from AdtParser#144.

What am I doing wrong?

Always use a static method named 'cases' for intiating pattern-matching

It well be more consistent with usual FP terminology. Currently we re-use name of the 'match' method. but it is misleading as pattern matching return a function.

Auto Type unification might not always be wanted.

I've noticed when having something like.

 . . .
    interface Cases<R,A> {
        R DoSomething(String x, F<Integer,A> k);
    }
 . . .

That it generates a Type constructor DoSomething(String x) without the k. It uses the Identity function for us.

However this might not always be the only function we would want to use.

For Example. Defining an F-algebra for the use in a free monad. Then defining a Functor instance for the F-algrebra becomes impossible, because we do not have DoSomething(String x, F<Integer,A> k) type constructor.

Maybe auto-generate both type constructors?

Generate Javadoc for all generated methods

Method generated with package private visibility are to be suffixed by 0

So that they won't clash with method reexported as public.
Address #44 (comment)

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.

Jobs

Jooble