You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm not a computer professional, I just want the Miniflux I built to be able to search for more content.
Deployment method
Both pg_cjk_parse and zhparser build images and run them through docker-compose.yml.
Test search content
When Miniflux uses pg_cjk_parse and zhparser as databases respectively, the same RSS entries are imported and forced to refresh. After waiting for a period of time, pg_cjk_parse gets 7829 unread items and zhparser gets 7824 unread items.
I tested Chinese and English in the search respectively, and found that the number given by pg_cjk_parse was much smaller than that of zhparser. For example, searching for Hangzhou, pg_cjk_parse showed 28 entries, and zhparser showed 123 entries. When searching for English, pg_cjk_parse and zhparser would also have differences. , in general, pg_cjk_parse will search for fewer entries.
question
I have installed several extensions on zhparser, will these have any impact?
ARG POSTGRES_VERSION=16
FROM postgres:$POSTGRES_VERSION as build
RUN apt-get update && apt-get install -y --no-install-recommends ca-certificates git postgresql-server-dev-16 gcc make icu-devtools libicu-dev
RUN mkdir -p /root/parser
WORKDIR /root/parser
RUN git clone https://github.com/huangjimmy/pg_cjk_parser.git /tmp/pg_cjk_parser && \
cp /tmp/pg_cjk_parser/pg_cjk_parser.c /tmp/pg_cjk_parser/pg_cjk_parser.control /tmp/pg_cjk_parser/Makefile /tmp/pg_cjk_parser/pg_cjk_parser--0.0.1.sql /tmp/pg_cjk_parser/zht2zh s.h /root/parser/ && \
make clean && make install
FROM postgres:$POSTGRES_VERSION
ARG POSTGRES_VERSION=16
COPY --from=build /root/parser/pg_cjk_parser.bc /usr/lib/postgresql/$POSTGRES_VERSION/lib/bitcode
COPY --from=build /root/parser/pg_cjk_parser.so /usr/lib/postgresql/$POSTGRES_VERSION/lib
COPY --from=build /root/parser/pg_cjk_parser--0.0.1.sql /usr/share/postgresql/$POSTGRES_VERSION/extension
COPY --from=build /root/parser/pg_cjk_parser.control /usr/share/postgresql/$POSTGRES_VERSION/extension
Build the image and run it, log in with pgAdmin, and configure the connection database
Select CREATE script on the postgres database, and a new window will pop up.
Clear the window content and enter
CREATE TEXT SEARCH PARSER public.pg_cjk_parser (
START = prsd2_cjk_start,
GETTOKEN = prsd2_cjk_nexttoken,
END = prsd2_cjk_end,
LEXTYPES = prsd2_cjk_lextype,
HEADLINE = prsd2_cjk_headline);
CREATE TEXT SEARCH CONFIGURATION public.config_2_gram_cjk (
PARSER = pg_cjk_parser
);
SET default_text_search_config = 'public.config_2_gram_cjk';
Click Run and clear the input
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR asciihword
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR cjk
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR email
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR asciiword
WITH english_stem;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR entity
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR file
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR float
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR host
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR hword
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR hword_asciipart
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR hword_numpart
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR hword_part
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR int
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR numhword
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR numword
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR protocol
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR sfloat
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR tag
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR uint
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR url
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR url_path
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR version
WITH simple;
ALTER TEXT SEARCH CONFIGURATION public.config_2_gram_cjk
ADD MAPPING FOR word
WITH simple;
Log in to Miniflux through the normal installation process and import RSS feeds.
introduce
I'm not a computer professional, I just want the Miniflux I built to be able to search for more content.
Deployment method
Both pg_cjk_parse and zhparser build images and run them through docker-compose.yml.
Test search content
When Miniflux uses pg_cjk_parse and zhparser as databases respectively, the same RSS entries are imported and forced to refresh. After waiting for a period of time, pg_cjk_parse gets 7829 unread items and zhparser gets 7824 unread items.
I tested Chinese and English in the search respectively, and found that the number given by pg_cjk_parse was much smaller than that of zhparser. For example, searching for
Hangzhou
, pg_cjk_parse showed 28 entries, and zhparser showed 123 entries. When searching for English, pg_cjk_parse and zhparser would also have differences. , in general, pg_cjk_parse will search for fewer entries.question
Schedule
pg_cjk_parse
docker-compose.yml
Dockerfile
CREATE script
on thepostgres
database, and a new window will pop up.Click Run and clear the input
zhparser
docker-compose.yml
Dockerfile
CREATE script
on thepostgres
database, and a new window will pop up.Run, then clear, enter
Run again, clear, enter
run
4. Log in to Miniflux through the normal installation process and import RSS feeds.
The text was updated successfully, but these errors were encountered: