BinaryIgor
diff --git a/‎README.md‎
Lines changed: 201 additions & 15 deletions b/‎README.md‎
Lines changed: 201 additions & 15 deletions
diff --git a/‎TODO.md‎
Lines changed: 3 additions & 2 deletions b/‎TODO.md‎
Lines changed: 3 additions & 2 deletions
diff --git a/‎benchmarks/events-db/init_db.sql‎
Lines changed: 1 addition & 1 deletion b/‎benchmarks/events-db/init_db.sql‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎benchmarks/runner/Dockerfile‎
Lines changed: 1 addition & 1 deletion b/‎benchmarks/runner/Dockerfile‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎benchmarks/runner/pom.xml‎
Lines changed: 1 addition & 1 deletion b/‎benchmarks/runner/pom.xml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎benchmarks/runner/src/main/java/com/binaryigor/eventsql/benchmarks/EventSQLBenchmarksRunner.java‎
Lines changed: 7 additions & 14 deletions b/‎benchmarks/runner/src/main/java/com/binaryigor/eventsql/benchmarks/EventSQLBenchmarksRunner.java‎
Lines changed: 7 additions & 14 deletions
@@ -12,7 +12,8 @@ For scalability details, see [benchmarks](/benchmarks/README.md).
 
 ## How it works
 
-We just need to have three tables:
+We just need to have three tables (postgres syntax):
+
 ```sql
 CREATE TABLE topic (
   name TEXT PRIMARY KEY,
@@ -27,7 +28,7 @@ CREATE TABLE event (
   key TEXT,
   value BYTEA NOT NULL,
   created_at TIMESTAMP NOT NULL DEFAULT NOW(),
-  metadata JSONB NOT NULL,
+  metadata JSON NOT NULL,
   PRIMARY KEY (topic, id)
 ) PARTITION BY LIST (topic);
 
@@ -43,29 +44,35 @@ CREATE TABLE consumer (
 ```
 
 To consume messages, we just need to periodically (every one to a few seconds) do:
+
 ```sql
 BEGIN;
 
 SELECT * FROM consumer 
-WHERE topic = : topic AND name = :c_name 
+WHERE topic = :topic AND name = :c_name 
 FOR UPDATE SKIP LOCKED;
 
 SELECT * FROM event
-WHERE (:last_event_id IS NULL) OR id > last_event_id
-ORDER BY id LIMIT N;
+WHERE topic = :topic AND (:last_event_id IS NULL OR id > :last_event_id)
+ORDER BY id LIMIT :limit;
 
 (process events)
 
 UPDATE consumer 
 SET last_event_id = :id,
     last_consumption_at = :now 
 WHERE topic = :topic AND name = :c_name;
+
+COMMIT;
 ```
 
-Optionally, to increase throughput & concurrency, we might have partitioned topic and consumers (-1 partition standing for not partitioned topic/consumer).
+Optionally, to increase throughput & concurrency, we might have a partitioned topic and consumers (-1 partition standing
+for not partitioned topic/consumer/event).
+
+Distribution of partitioned events is the sole responsibility of the publisher.
+
+Consumption of such events per partition (0 in an example) might look like this:
 
-Distribution of partitioned events is a sole responsibility of publisher - the library provides sensible default (random distribution).
-Consumption of such events per partition (0 in example) might look like this:
 ```sql
 BEGIN;
 
@@ -74,26 +81,205 @@ WHERE topic = :topic AND name = :c_name AND partition = 0
 FOR UPDATE SKIP LOCKED;
 
 SELECT * FROM event
-WHERE (:last_event_id IS NULL) OR id > last_event_id AND partition = 0
-ORDER BY id LIMIT N;
+WHERE topic = :topic AND partition = 0 AND (:last_event_id IS NULL OR id > :last_event_id)
+ORDER BY id LIMIT :limit;
 
 (process events)
 
 UPDATE consumer 
 SET last_event_id = :id,
     last_consumption_at = :now
 WHERE topic = :topic AND name = :c_name AND partition = 0;
+
+COMMIT;
 ```
 
-Limitation being that if consumer is partitioned, it must have the exact same number of partition as in the topic
-definition.
-It's a rather acceptable tradeoff and easy to enforce at the library level.
+Limitation being that if consumer is partitioned, it must have the exact same number of partition as the topic
+definition has. It's a rather acceptable tradeoff and easy to enforce at the library level.
 
 ## How to use it
 
-TODO: for now, check out benchmarks/app being an example app.
+`EventSQL` is an entrypoint to the whole library. It requires standard Java `javax.sql.DataSource` or a list of
+them:
+
+```java
+
+import com.binaryigor.eventsql.EventSQL;
+import javax.sql.DataSource;
+// dialect of your events backend - POSTGRES, MYSQL, MARIADB and so on;
+// as of now, only POSTGRES has fully tested support;
+// should also work with others but some things - event table partition management for example - works only with Postgres, for others it must be managed manually
+import org.jooq.SQLDialect;
+
+var eventSQL = new EventSQL(dataSource, SQLDialect.POSTGRES);
+ver shardedEventSQL = new EventSQL(dataSources, SQLDialect.POSTGRES);
+```
+
+Sharded version works in the same vain - it just assumes that topics and consumers are hosted on multiple dbs.
+
+### Topics and Consumers
+
+Having `EventSQL` instance, we can register topics and their consumers:
+
+```java
+// all operations are idempotent
+eventSQL.registry()
+  // -1 stands for not partitioned topic  
+  .registerTopic(new TopicDefinition("account_created", -1))
+  .registerTopic(new TopicDefinition("invoice_issued", 5))
+  // thirds argument (true/false) determines whether Consumer is partitioned or not      
+  .registerConsumer(new ConsumerDefinition("account_created", "consumer-1", false))
+  .registerConsumer(new ConsumerDefinition("invoice_issued", "consumer-2", true));
+```
+
+Topics and consumers can be both partitioned and not partitioned.
+**Partitioned topics allow to have partitioned consumers, increasing parallelism.**
+Parallelism of partitioned consumers is as high as consumed topic number of partitions - events have ordering guarantee within a partition.
+As a consequence, for a given consumer, each partition can be processed only by a single thread at the time.
+
+For a consumer to be partitioned its topic must be partitioned as well - it will have the same number of partitions. 
+The opposite does not have to be true - consumer might not be partitioned but a related topic can; it has performance implications though, since as described above, consumer parallelism is capped at its number of partitions.
+
+**With sharding, partitions are multiplied by the number of shards (db instances).**
+
+For example, if we have *3 shards (3 dbs) and a topic with 10 partitions - each shard (db) will host 10 partitions, giving 30 partitions in total*.
+Same with consumers of a sharded topic - they will be all multiplied by the number of shards.
+
+For events, it works differently - in the example above, *each shard will host ~ 33% (1/3) of the topic events data* (assuming even partition distribution).
+To get all events, we must read them from all shards.
+
+There will be *30 consumer instances* in this particular case - `3 shards * 10 partitions`; each consuming from one partition hosted on a given shard.
+Each event will be published to a one partition of a single shard - as a consequence, events are unique globally, across all shards.
+
+### Publishing
+
+We can publish single events and batches of arbitrary data and type:
+```java
+var publisher = eventSQL.publisher();
+
+publisher.publish(new EventPublication("txt_topic", "txt event".getBytes(StandardCharsets.UTF_8)));
+publisher.publish(new EventPublication("raw_topic", new byte[]{1, 2, 3}));
+publisher.publish(new EventPublication("json_topic",
+  """
+  {
+    "id": 2,
+    "name: "some-user"
+  }
+  """.getBytes(StandardCharsets.UTF_8)));
+
+// events can have keys and metadata as well;
+// key determines event distribution - if it's null, partition is randomly assigned      
+publisher.publish(new EventPublication("txt_topic",
+  "event-key",
+  "txt event".getBytes(StandardCharsets.UTF_8),
+  Map.of("some-tag", "some-meta-info")));
+
+
+// events can be also published in batches, for improved throughput
+publisher.publishAll(List.of(
+  new EventPublication("txt_topic", "txt event 1".getBytes(StandardCharsets.UTF_8)),
+  new EventPublication("txt_topic", "txt event 2".getBytes(StandardCharsets.UTF_8)),
+  new EventPublication("txt_topic", "txt event 3".getBytes(StandardCharsets.UTF_8))));
+```
+
+### Partitioner
+
+Event partition is determined by `EventSQLPublisher.Partitioner`. By default, the following implementation is used:
+```java
+public class DefaultPartitioner implements EventSQLPublisher.Partitioner {
+
+  private static final Random RANDOM = new Random();
+
+  @Override
+  public int partition(EventPublication publication, int topicPartitions) {
+    if (topicPartitions == -1) {
+      return -1;
+    }
+    if (publication.key() == null) {
+      return RANDOM.nextInt(topicPartitions);
+    }
+
+    return keyHash(publication.key())
+             .mod(BigInteger.valueOf(topicPartitions))
+             .intValue();
+  }
+    
+  ...
+```
+
+For a not partitioned topic, no partition is assigned.
+
+If the topic is partitioned and has a null key - partition is random. 
+If a key is defined, the partition is assigned based on key hash. Thanks to this, we have a guarantee that events associated with the same key always land in the same partition.
+
+If you want to change this behavior, you can provide your own implementation and configure it by calling `EventSQLPublisher.configurePartitioner` method.
+
+### Consuming
+
+We can have both single event and batch consumers:
+```java
+var consumers = eventSQL.consumers();
+
+consumers.startConsumer("txt_topic", "single-consumer", event -> {
+  // handle single event
+});
+// with more frequent polling - by default it is 1 second
+consumers.startConsumer("txt_topic", "single-consumer-customized", event -> {
+  // handle single event
+}, Duration.ofMillis(100));
+
+consumers.startBatchConsumer("txt_topic", "batch-consumer", events -> {
+  // handle events batch for better performance
+}, // customize batch behavior:
+  // minEvents, maxEvents,
+  // pollingDelay and maxPollingDelay - how long to wait for minEvents
+  EventSQLConsumers.ConsumptionConfig.of(5, 100,
+    Duration.ofSeconds(1), Duration.ofSeconds(10)));
+```
+
+### Dead Letter Topics (DLT)
+
+If we register a topic with DLT as follows:
+```java
+eventSQL.registry()
+  .registerTopic(new TopicDefinition("account_created", -1))
+  .registerTopic(new TopicDefinition("account_created_dlt", -1));
+```
+Under certain circumstances, it will get a special treatment.
+
+When a consumer throws `EventSQLConsumptionException`, `DefaultDLTEventFactory` takes over and publishes failed event to the associated DLT, if it can find one:
+```java
+...
+
+@Override
+public Optional<EventPublication> create(EventSQLConsumptionException exception, String consumer) {
+  var event = exception.event();
+
+  var dltTopic = event.topic() + "_dlt";
+  var dltTopicDefinitionOpt = topicDefinitionsCache.getLoadingIf(dltTopic, true);
+  if (dltTopicDefinitionOpt.isEmpty()) {
+    return Optional.empty();
+  }
+    
+  ...
+
+  // creates dlt event    
+```
+
+This factory can be customized by calling `EventSQLConsumers.configureDLTEventFactory` method.
+
+What is also worth noting is that any exception thrown by a single event consumer is wrapped into `EventSQLConsumptionException` automatically - see `ConsumerWrapper` class.
+
+When you use `EventSQLConsumers.startBatchConsumer` you have to do the wrapping yourself.
 
 
 ## How to get it
 
-TODO
+Maven:
+```
+TODO: publish it
+```
+Gradle:
+```
+TODO: publish it
+```
@@ -1,9 +1,10 @@
 * usage examples
-    * just pub/sub
     * giving access to event tables as a means of a simple export - since they are all there
 * expiring events/TTL?
 * compact topics - unique key
 * join, aka streams
 * more elaborate definitions change support - especially around partitions growth & shrinkage
 * JavaDocs
-* Support schemas init in registry - why require schemas from users, if it is always the same?
+* Support schemas init in registry - why require schemas from users, if it is always the same (keeping dbs diffs in mind)?
+* using JOOQ tradeoffs?
+* full MySQL/MariaDB support
@@ -19,7 +19,7 @@ CREATE TABLE event (
   key TEXT,
   value BYTEA NOT NULL,
   created_at TIMESTAMP NOT NULL DEFAULT NOW(),
-  metadata JSONB NOT NULL,
+  metadata JSON NOT NULL,
   PRIMARY KEY (topic, id)
 ) PARTITION BY LIST (topic);
 
 
@@ -1,5 +1,5 @@
 FROM eclipse-temurin:21-alpine
 
-COPY target/eventsql-benchmarks-runer-jar-with-dependencies.jar /app.jar
+COPY target/eventsql-benchmarks-runner-jar-with-dependencies.jar /app.jar
 
 ENTRYPOINT ["java", "-jar", "/app.jar"]
@@ -51,7 +51,7 @@
                             <goal>single</goal>
                         </goals>
                         <configuration>
-                            <finalName>eventsql-benchmarks-runer</finalName>
+                            <finalName>eventsql-benchmarks-runner</finalName>
                             <archive>
                                 <manifest>
                                     <mainClass>com.binaryigor.eventsql.benchmarks.EventSQLBenchmarksRunner</mainClass>
 
@@ -61,7 +61,7 @@ public static void main(String[] args) throws Exception {
 
         var start = System.currentTimeMillis();
 
-        publishEvents(eventSQL.publisher(), topicDefinition);
+        publishEvents(eventSQL.publisher());
         var publicationDuration = Duration.ofMillis(System.currentTimeMillis() - start);
 
         printDelimiter();
@@ -94,11 +94,6 @@ static String envValueOrDefault(String key, String defaultValue) {
         return System.getenv().getOrDefault(key, defaultValue);
     }
 
-    static String envValueOrThrow(String key) {
-        return Optional.ofNullable(System.getenv().get(key))
-                .orElseThrow(() -> new RuntimeException("%s env variable is required but was not supplied!".formatted(key)));
-    }
-
     static int envIntValueOrDefault(String key, int defaultValue) {
         return Integer.parseInt(envValueOrDefault(key, String.valueOf(defaultValue)));
     }
@@ -173,12 +168,12 @@ static <T> T executeQuery(DataSource source, String query, ResultSetMapper<T> re
         }
     }
 
-    static void publishEvents(EventSQLPublisher publisher, TopicDefinition topicDefinition) throws Exception {
+    static void publishEvents(EventSQLPublisher publisher) throws Exception {
         var futures = new LinkedList<Future<?>>();
 
         try (var executor = Executors.newVirtualThreadPerTaskExecutor()) {
             for (var i = 0; i < EVENTS_TO_PUBLISH; i++) {
-                var result = executor.submit(() -> publishRandomEvent(publisher, topicDefinition));
+                var result = executor.submit(() -> publishRandomEvent(publisher));
                 futures.add(result);
 
                 var publications = i + 1;
@@ -198,20 +193,18 @@ static void publishEvents(EventSQLPublisher publisher, TopicDefinition topicDefi
         }
     }
 
-    static void publishRandomEvent(EventSQLPublisher publisher, TopicDefinition topicDefinition) {
+    static void publishRandomEvent(EventSQLPublisher publisher) {
         try {
             // make publication more evenly distributed in time
             Thread.sleep(RANDOM.nextInt(1000));
-            var partition = RANDOM.nextInt(topicDefinition.partitions());
-            var event = nextEvent(partition);
-            publisher.publish(event);
+            publisher.publish(nextEvent());
         } catch (Exception e) {
             e.printStackTrace();
         }
     }
 
-    static EventPublication nextEvent(int partition) {
-        return new EventPublication(TEST_TOPIC, partition, accountCreatedEventJson().getBytes(StandardCharsets.UTF_8));
+    static EventPublication nextEvent() {
+        return new EventPublication(TEST_TOPIC, accountCreatedEventJson().getBytes(StandardCharsets.UTF_8));
     }
 
     static String accountCreatedEventJson() {
Original file line number	Diff line number	Diff line change
`@@ -61,7 +61,7 @@ public static void main(String[] args) throws Exception {`
`61`	`61`
`62`	`62`	`var start = System.currentTimeMillis();`
`63`	`63`
`64`		`- publishEvents(eventSQL.publisher(), topicDefinition);`
	`64`	`+ publishEvents(eventSQL.publisher());`
`65`	`65`	`var publicationDuration = Duration.ofMillis(System.currentTimeMillis() - start);`
`66`	`66`
`67`	`67`	`printDelimiter();`
`@@ -94,11 +94,6 @@ static String envValueOrDefault(String key, String defaultValue) {`
`94`	`94`	`return System.getenv().getOrDefault(key, defaultValue);`
`95`	`95`	`}`
`96`	`96`
`97`		`- static String envValueOrThrow(String key) {`
`98`		`- return Optional.ofNullable(System.getenv().get(key))`
`99`		`- .orElseThrow(() -> new RuntimeException("%s env variable is required but was not supplied!".formatted(key)));`
`100`		`- }`
`101`		`-`
`102`	`97`	`static int envIntValueOrDefault(String key, int defaultValue) {`
`103`	`98`	`return Integer.parseInt(envValueOrDefault(key, String.valueOf(defaultValue)));`
`104`	`99`	`}`
`@@ -173,12 +168,12 @@ static <T> T executeQuery(DataSource source, String query, ResultSetMapper<T> re`
`173`	`168`	`}`
`174`	`169`	`}`
`175`	`170`
`176`		`- static void publishEvents(EventSQLPublisher publisher, TopicDefinition topicDefinition) throws Exception {`
	`171`	`+ static void publishEvents(EventSQLPublisher publisher) throws Exception {`
`177`	`172`	`var futures = new LinkedList<Future<?>>();`
`178`	`173`
`179`	`174`	`try (var executor = Executors.newVirtualThreadPerTaskExecutor()) {`
`180`	`175`	`for (var i = 0; i < EVENTS_TO_PUBLISH; i++) {`
`181`		`- var result = executor.submit(() -> publishRandomEvent(publisher, topicDefinition));`
	`176`	`+ var result = executor.submit(() -> publishRandomEvent(publisher));`
`182`	`177`	`futures.add(result);`
`183`	`178`
`184`	`179`	`var publications = i + 1;`
`@@ -198,20 +193,18 @@ static void publishEvents(EventSQLPublisher publisher, TopicDefinition topicDefi`
`198`	`193`	`}`
`199`	`194`	`}`
`200`	`195`
`201`		`- static void publishRandomEvent(EventSQLPublisher publisher, TopicDefinition topicDefinition) {`
	`196`	`+ static void publishRandomEvent(EventSQLPublisher publisher) {`
`202`	`197`	`try {`
`203`	`198`	`// make publication more evenly distributed in time`
`204`	`199`	`Thread.sleep(RANDOM.nextInt(1000));`
`205`		`- var partition = RANDOM.nextInt(topicDefinition.partitions());`
`206`		`- var event = nextEvent(partition);`
`207`		`- publisher.publish(event);`
	`200`	`+ publisher.publish(nextEvent());`
`208`	`201`	`} catch (Exception e) {`
`209`	`202`	`e.printStackTrace();`
`210`	`203`	`}`
`211`	`204`	`}`
`212`	`205`
`213`		`- static EventPublication nextEvent(int partition) {`
`214`		`- return new EventPublication(TEST_TOPIC, partition, accountCreatedEventJson().getBytes(StandardCharsets.UTF_8));`
	`206`	`+ static EventPublication nextEvent() {`
	`207`	`+ return new EventPublication(TEST_TOPIC, accountCreatedEventJson().getBytes(StandardCharsets.UTF_8));`
`215`	`208`	`}`
`216`	`209`
`217`	`210`	`static String accountCreatedEventJson() {`