Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(clp-s): Add boilerplate for SQL parsing. #504

Merged
merged 30 commits into from
Jan 21, 2025
Merged
Show file tree
Hide file tree
Changes from 14 commits
Commits
Show all changes
30 commits
Select commit Hold shift + click to select a range
b41eed1
Add boilerplate for new sql parser
gibber9809 Jul 31, 2024
86acbf5
Fix lint
gibber9809 Jul 31, 2024
93cb37c
Apply suggestions from code review
gibber9809 Aug 2, 2024
b5f55d3
Merge branch 'main' into basic-sql
gibber9809 Aug 2, 2024
d96505e
Move antlr visitor derived classes into private namespace and dedupli…
gibber9809 Aug 9, 2024
436c396
Apply suggestions from code review
gibber9809 Aug 12, 2024
4037d5b
Remove duplicated code
gibber9809 Aug 12, 2024
2b5db82
Merge remote-tracking branch 'upstream/main' into basic-sql
gibber9809 Sep 16, 2024
f8d67fb
Fix clang-tidy warnings
gibber9809 Sep 16, 2024
ce70d88
Undo incorrect clang-tidy suggestion
gibber9809 Sep 16, 2024
0f768a5
More clang-tidy fixes
gibber9809 Sep 16, 2024
a1d8fe7
Fix compilation error
gibber9809 Sep 22, 2024
1805505
Apply suggestions from code review
gibber9809 Sep 25, 2024
4af7f86
Lint fix
gibber9809 Sep 25, 2024
d09e6c4
Fix macOS build
gibber9809 Sep 25, 2024
991ca73
Apply suggestions from code review
gibber9809 Oct 22, 2024
ce13759
Update components/core/src/clp_s/search/sql/sql.cpp
gibber9809 Oct 22, 2024
5696ac3
Address more review comments
gibber9809 Oct 22, 2024
712afbe
Add comment indicating sql grammar is incomplete boilerplate
gibber9809 Oct 22, 2024
c2432bb
Minor fix
gibber9809 Oct 23, 2024
cd7f164
Update components/core/src/clp_s/search/sql/sql.cpp
gibber9809 Oct 23, 2024
d687fe4
Merge remote-tracking branch 'upstream/main' into basic-sql
gibber9809 Nov 7, 2024
8f00de2
Merge remote-tracking branch 'upstream/main' into basic-sql
gibber9809 Nov 13, 2024
29f8744
Remove spdlog dependency to fix build issue on macOS
gibber9809 Nov 18, 2024
f0118f1
Revert "Remove spdlog dependency to fix build issue on macOS"
gibber9809 Nov 18, 2024
866dcfc
Attempt to fix macOS build issues
gibber9809 Nov 18, 2024
e17ebd0
Merge remote-tracking branch 'upstream/main' into basic-sql
gibber9809 Nov 18, 2024
012915f
Merge remote-tracking branch 'upstream/main' into basic-sql
gibber9809 Jan 21, 2025
fbb28eb
Fix strange macos build issue
gibber9809 Jan 21, 2025
b0e1a7c
lint fix
gibber9809 Jan 21, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions components/core/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -519,6 +519,7 @@ set(SOURCE_FILES_unitTest
tests/test-Stopwatch.cpp
tests/test-StreamingCompression.cpp
tests/test-string_utils.cpp
tests/test-sql.cpp
tests/test-TimestampPattern.cpp
tests/test-utf8_utils.cpp
tests/test-Utils.cpp
Expand All @@ -539,6 +540,7 @@ target_link_libraries(unitTest
LibArchive::LibArchive
MariaDBClient::MariaDBClient
spdlog::spdlog
sql
OpenSSL::Crypto
${sqlite_LIBRARY_DEPENDENCIES}
${STD_FS_LIBS}
Expand Down
1 change: 1 addition & 0 deletions components/core/src/clp_s/CMakeLists.txt
Original file line number Diff line number Diff line change
@@ -1,4 +1,5 @@
add_subdirectory(search/kql)
add_subdirectory(search/sql)

set(
CLP_SOURCES
Expand Down
36 changes: 36 additions & 0 deletions components/core/src/clp_s/search/antlr_common/ErrorListener.hpp
Original file line number Diff line number Diff line change
@@ -0,0 +1,36 @@
#ifndef CLP_S_SEARCH_ANTLRCOMMON_ERRORLISTENER_HPP
#define CLP_S_SEARCH_ANTLRCOMMON_ERRORLISTENER_HPP

#include <cstddef>
#include <exception>
#include <string>
#include <string_view>

#include <antlr4-runtime.h>
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved

namespace clp_s::search::antlr_common {
class ErrorListener : public antlr4::BaseErrorListener {
public:
auto syntaxError(
[[maybe_unused]] antlr4::Recognizer* recognizer,
[[maybe_unused]] antlr4::Token* offending_symbol,
[[maybe_unused]] size_t line,
[[maybe_unused]] size_t char_position_in_line,
std::string const& msg,
[[maybe_unused]] std::exception_ptr e
) -> void override {
m_error = true;
m_error_message = msg;
}

[[nodiscard]] auto error() const -> bool { return m_error; }

[[nodiscard]] auto message() const -> std::string_view { return m_error_message; }

private:
bool m_error{false};
std::string m_error_message;
};
} // namespace clp_s::search::antlr_common

#endif // CLP_S_SEARCH_ANTLRCOMMON_ERRORLISTENER_HPP
1 change: 1 addition & 0 deletions components/core/src/clp_s/search/kql/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,7 @@ ANTLR_TARGET(
add_library(
kql
../../Utils.hpp
../antlr_common/ErrorListener.hpp
../AndExpr.hpp
../BooleanLiteral.hpp
../ColumnDescriptor.hpp
Expand Down
36 changes: 7 additions & 29 deletions components/core/src/clp_s/search/kql/kql.cpp
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I will probably leave the change of kql to the next PR. Maybe ask Kirk's advice @kirkrodrigues

Original file line number Diff line number Diff line change
Expand Up @@ -5,14 +5,9 @@
#include <antlr4-runtime.h>
#include <spdlog/spdlog.h>

#include "KqlBaseVisitor.h"
#include "KqlLexer.h"
#include "KqlParser.h"
// If redlining may want to add ${workspaceFolder}/build/**
// to include path for vscode C/C++ utils

#include "../../Utils.hpp"
#include "../AndExpr.hpp"
#include "../antlr_common/ErrorListener.hpp"
#include "../BooleanLiteral.hpp"
#include "../ColumnDescriptor.hpp"
#include "../DateLiteral.hpp"
Expand All @@ -22,34 +17,16 @@
#include "../NullLiteral.hpp"
#include "../OrExpr.hpp"
#include "../StringLiteral.hpp"
#include "KqlBaseVisitor.h"
#include "KqlLexer.h"
#include "KqlParser.h"

using namespace antlr4;
using namespace kql;
using clp_s::search::antlr_common::ErrorListener;

namespace clp_s::search::kql {
class ErrorListener : public BaseErrorListener {
public:
void syntaxError(
Recognizer* recognizer,
Token* offending_symbol,
size_t line,
size_t char_position_in_line,
std::string const& msg,
std::exception_ptr e
) override {
m_error = true;
m_error_message = msg;
}

bool error() const { return m_error; }

std::string const& message() const { return m_error_message; }

private:
bool m_error{false};
std::string m_error_message;
};

namespace {
class ParseTreeVisitor : public KqlBaseVisitor {
private:
static void
Expand Down Expand Up @@ -224,6 +201,7 @@ class ParseTreeVisitor : public KqlBaseVisitor {
return base;
}
};
} // namespace

std::shared_ptr<Expression> parse_kql_expression(std::istream& in) {
ErrorListener lexer_error_listener;
Expand Down
29 changes: 29 additions & 0 deletions components/core/src/clp_s/search/sql/CMakeLists.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
ANTLR_TARGET(
SqlParser
Sql.g4
LEXER PARSER VISITOR
PACKAGE sql
)

add_library(
sql
../../Utils.hpp
../antlr_common/ErrorListener.hpp
../AndExpr.hpp
../BooleanLiteral.hpp
../ColumnDescriptor.hpp
../DateLiteral.hpp
../EmptyExpr.hpp
../Expression.hpp
../FilterExpr.hpp
../Integral.hpp
../NullLiteral.hpp
../OrExpr.hpp
../StringLiteral.hpp
${ANTLR_SqlParser_CXX_OUTPUTS}
sql.cpp
sql.hpp
)
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved
target_compile_features(sql PRIVATE cxx_std_20)
target_include_directories(sql PRIVATE ${ANTLR_SqlParser_OUTPUT_DIR})
target_link_libraries(sql PRIVATE antlr4_static)
5 changes: 5 additions & 0 deletions components/core/src/clp_s/search/sql/Sql.g4
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we add a comment explaining this is a boilerplate? Asked Kirk to take a brief view and his feedback is this file can be confusing to people new to it without a comment explaining it's a WIP config

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fwiw, the Rabbit agrees with me, lol.

Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
grammar Sql;

start: EOF ;

SPACE: [ \t\r\n] -> skip ;
59 changes: 59 additions & 0 deletions components/core/src/clp_s/search/sql/sql.cpp
Original file line number Diff line number Diff line change
@@ -0,0 +1,59 @@
#include <any>
#include <iostream>
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved
#include <memory>

#include <antlr4-runtime.h>
#include <spdlog/spdlog.h>

#include "../antlr_common/ErrorListener.hpp"
#include "../EmptyExpr.hpp"
#include "../Expression.hpp"
#include "SqlBaseVisitor.h"
#include "SqlLexer.h"
#include "SqlParser.h"

using antlr4::ANTLRInputStream;
using antlr4::CommonTokenStream;
using clp_s::search::antlr_common::ErrorListener;
using sql::SqlBaseVisitor;
using sql::SqlLexer;
using sql::SqlParser;

namespace clp_s::search::sql {
namespace {
class ParseTreeVisitor : public SqlBaseVisitor {
public:
[[nodiscard]] auto visitStart([[maybe_unused]] SqlParser::StartContext* ctx
) -> std::any override {
return EmptyExpr::create();
}
};
} // namespace

auto parse_sql_expression(std::istream& in) -> std::shared_ptr<Expression> {
ErrorListener lexer_error_listener;
ErrorListener parser_error_listener;

ANTLRInputStream input{in};
SqlLexer lexer(&input);
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved
lexer.removeErrorListeners();
lexer.addErrorListener(&lexer_error_listener);
CommonTokenStream tokens(&lexer);
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved
SqlParser parser(&tokens);
parser.removeErrorListeners();
parser.addErrorListener(&parser_error_listener);
SqlParser::StartContext* tree = parser.start();
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved

if (lexer_error_listener.error()) {
SPDLOG_ERROR("Lexer error: {}", lexer_error_listener.message());
return nullptr;
}
if (parser_error_listener.error()) {
SPDLOG_ERROR("Parser error: {}", parser_error_listener.message());
return {};
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved
}

ParseTreeVisitor visitor;
return std::any_cast<std::shared_ptr<Expression>>(visitor.visitStart(tree));
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved
}
} // namespace clp_s::search::sql
18 changes: 18 additions & 0 deletions components/core/src/clp_s/search/sql/sql.hpp
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
#ifndef CLP_S_SEARCH_SQL_SQL_HPP
#define CLP_S_SEARCH_SQL_SQL_HPP

#include <istream>
#include <memory>

#include "../Expression.hpp"

namespace clp_s::search::sql {
/**
* Parses an SQL expression from the given stream to generate a search AST.
* @param in Input stream containing an SQL expression followed by EOF
* @return a search AST on success, nullptr otherwise
*/
[[nodiscard]] auto parse_sql_expression(std::istream& in) -> std::shared_ptr<Expression>;
} // namespace clp_s::search::sql

#endif // CLP_S_SEARCH_SQL_SQL_HPP
24 changes: 24 additions & 0 deletions components/core/tests/test-sql.cpp
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
#include <memory>
#include <sstream>
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved

#include <Catch2/single_include/catch2/catch.hpp>
#include <spdlog/spdlog.h>
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved

#include "../src/clp_s/search/EmptyExpr.hpp"
#include "../src/clp_s/search/sql/sql.hpp"
#include "LogSuppressor.hpp"

using clp_s::search::EmptyExpr;
using clp_s::search::sql::parse_sql_expression;
using std::stringstream;

TEST_CASE("Test parsing SQL", "[SQL]") {
// Suppress logging
LogSuppressor suppressor{};
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Not used?

Copy link
Contributor Author

@gibber9809 gibber9809 Sep 25, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Its use is entirely implemented in the constructor/destructor. It just turns off spdlog logging once created, and re-enables it once destroyed.

Just allows you to run parsing unit tests without emitting tons of log messages for parsing failure.

gibber9809 marked this conversation as resolved.
Show resolved Hide resolved

SECTION("Stub accepts empty string") {
stringstream empty_string{""};
auto filter = std::dynamic_pointer_cast<EmptyExpr>(parse_sql_expression(empty_string));
REQUIRE(nullptr != filter);
gibber9809 marked this conversation as resolved.
Show resolved Hide resolved
}
}
Loading