The landing page for fashion—shows, products, designers, and more.
https://clothes.nicholaschiang.com
...contains the Remix UI for the project. Built with Remix, React, Tailwind, Postgres (with Prisma ORM deployed on Supabase), and deployed on Vercel.
https://collections.nicholaschiang.com
...contains the SvelteKit UI for the project. This is still under construction. I aim to use an in-browser database (either IndexedDB or a SQLite database in OPFS) to store data. This will be a completely static deployment (likely on Vercel for convenience) which will save money on servers (as there will be none). Everything will run on the client's device and thus should feel instantaneous.
...contains the data collection scripts used to scrape fashion data from the web. Built with Python (Scrapy) and some JS scripts (mostly due to my familiarity with JS) that are run with Node.js.
The /shows
feature of this project is Rotten Tomatoes for fashion.
The Critic Score—based on the opinions of hundreds of fashion critics—is a trusted measurement of critical recommendation for millions of consumers.
The Critic Score represents the percentage of professional critic reviews that are positive for a given runway show. A Critic Score is calculated for a runway show after it receives at least five reviews.
When at least 60% of reviews for a runway show are positive, a blossoming rose is displayed to indicate its Positive status.
When less than 60% of reviews for a runway show are positive, a dead rose is displayed to indicate its Negative status.
When there is no Critic Score available, which could be because the show hasn’t aired yet or there are not enough ratings to generate a score, a gray rose is displayed.
I collect runway show pictures, reviews, and related data every week, generating Critic Scores.
I manually copy-and-paste information into static
and import that data into Postgres using a sequence of scripts.
I often use machine learning tools to assist in the curation, summarization, and sentiment analysis process.
A more advanced web scraping setup is probably appropriate, but doing so would be quite difficult given the anti-bot measures most fashion websites employ.
The Consumer Score represents the percentage of users who have rated a runway show or collection positively. Right now, this is open to the public: anyone can create an account and submit their opinion. Eventually, I may gate this ability behind a purchase: users must first verify that they have bought a product before they can be counted amongst the "consumers" and submit a review.
The ultimate goal of this project is to create an interface that I can use to objectively rank and queue up high fashion purchases. I want an organized browser of the world of fashion.
It would be super cool IMO to open a product page and, in addition to seeing sizing, images, modeling, prices (from the various retailers that stock said item), one could also see the designer who made it (and easily see what else that designer has made and what other companies that designer has worked for), the company that owns the product's brand (and recent financial news regarding said corporation), etc.
It could also be cool to be able to match up similar products from different brands (as most brands' ready-to-wear collections will typically follow similar trends and thus contain similar pieces) in a side-by-side feature comparison view (similar to how you can compare different Apple products before making a decision on which one to buy): e.g. you could open up a Dior sweater and a GUCCI sweater side-by-side and make your purchase decision not only based on price or material sourcing or factory location or designer but also by the brand's reputation index (a.k.a. if resale value might increase) or a corporations climate change policies (or the lack thereof).
To implement features like that, I'll need a really good database schema (see #1). And then all that needs to be done is:
- figuring out where and how to scrape all of this data (ideally I want all past collections as well);
- building a basic page-based application with all of this data (perhaps using ISG from Next.js or similar).
The initial MVP will only contain a few pages (see #4):
- a basic
/products
page that lets users see a massive list of all the products in our database (and filter them using Linear style filters... filter sets can be saved as "views"); - a
/products/{id}
page that lets users see all the details of a particular product (prices, sizes, collections, variants, brands, companies, countries, runway shows, etc); - a
/views
page that shows a list of the user's views that they've saved from/products
filters (these are essentially just sets of filters); - a
/views/{id}
page that is the same as the/products
page but clicking the "Save" button updates the view instead of creating a new view (and trying to navigate away from the page after changing filters and not saving will surface a warning asking the user if they'd like to save their changes).
The initial Figma mockup of the /products
page (inspired by Linear's issues filtering features):
This repository's git workflow is inspired from traditional gitflow:
prod
is the production branch that is deployed to production on push;main
is the default development branch that is deployed to staging on push;<username>/<issue>
are the feature and bug fix branches that we use for day-to-day development (see below).
To work on something and deploy to staging:
- make sure there is a Linear ticket for it;
- create a new branch
<username>/<issue>
(e.g.nicholas/ns-1660
or, if you prefer more readable branch names,nicholas/data-loading
); - commit changes to that branch early and often, following the conventional commit message format;
- create a new PR to merge your branch into
main
and request the desired reviewers (at least one must be a code owner; - once that PR is merged into
main
, your changes will be deployed to staging and Linear will automatically close the corresponding ticket.
To deploy to production:
- merge
main
intoprod
.
This site was bootstrapped using the Remix Blues Stack. Learn more about Remix Stacks.
npx create-remix@latest --template remix-run/blues-stack
- Multi-region Fly app deployment with Docker
- Multi-region Fly PostgreSQL Cluster
- Healthcheck endpoint for Fly backups region fallbacks
- GitHub Actions for deploy on merge to production and staging environments
- Email/Password Authentication with cookie-based sessions
- Database ORM with Prisma
- Styling with Tailwind
- End-to-end testing with Cypress
- Local third party request mocking with MSW
- Unit testing with Vitest and Testing Library
- Code formatting with Prettier
- Linting with ESLint
- Static Types with TypeScript
Not a fan of bits of the stack? Fork it, change it, and use npx create-remix --template your/repo
! Make it your own.
Click this button to create a Gitpod workspace with the project set up, Postgres started, and Fly pre-installed
-
This step only applies if you've opted out of having the CLI install dependencies for you:
npx remix init
-
Start the Postgres Database in Docker:
pnpm docker
Note: The pnpm script will complete while Docker sets up the container in the background. Ensure that Docker has finished and your container is running before proceeding.
-
Initial setup:
pnpm run setup
-
Run the first build:
pnpm build
-
Start dev server:
pnpm dev
This starts your app in development mode, rebuilding assets on file changes.
The database seed script creates a new user with some data you can use to get started:
- Email:
[email protected]
- Password:
racheliscool
If you'd prefer not to use Docker, you can also use Fly's Wireguard VPN to connect to a development database (or even your production database). You can find the instructions to set up Wireguard here, and the instructions for creating a development database here.
This is a pretty simple note-taking app, but it's a good example of how you can build a full stack app with Prisma and Remix. The main functionality is creating users, logging in and out, and creating and deleting notes.
- creating users, and logging in and out ./app/models/user.server.ts
- user sessions, and verifying them ./app/session.server.ts
- creating, and deleting notes ./app/models/note.server.ts
This Remix Stack comes with two GitHub Actions that handle automatically deploying your app to production and staging environments.
Prior to your first deployment, you'll need to do a few things:
-
Sign up and log in to Fly
fly auth signup
Note: If you have more than one Fly account, ensure that you are signed into the same account in the Fly CLI as you are in the browser. In your terminal, run
fly auth whoami
and ensure the email matches the Fly account signed into the browser. -
Create two apps on Fly, one for staging and one for production:
fly apps create nicholas-eng fly apps create nicholas-eng-staging
Note: Once you've successfully created an app, double-check the
fly.toml
file to ensure that theapp
key is the name of the production app you created. This Stack automatically appends a unique suffix at init which may not match the apps you created on Fly. You will likely see 404 errors in your Github Actions CI logs if you have this mismatch. -
Initialize Git.
git init
-
Create a new GitHub Repository, and then add it as the remote for your project. Do not push your app yet!
git remote add origin <ORIGIN_URL>
-
Add a
FLY_API_TOKEN
to your GitHub repo. To do this, go to your user settings on Fly and create a new token, then add it to your repo secrets with the nameFLY_API_TOKEN
. -
Specify the
OPTIMIZE_IMAGES
feature flag, to do this you can run the following commands:fly secrets set OPTIMIZE_IMAGES=true --app nicholas-eng fly secrets set OPTIMIZE_IMAGES=false --app nicholas-eng-staging
Note: Enabling this feature flag will dramatically increase CPU/RAM load; your application may OOM and you may incur unexpectedly high compute costs (~$100 for two
shared-cpu-1x
on Fly.io for the additional RAM, shared CPU seconds, and SSD storage required for image optimization).It is recommended to disable this flag in staging and only enable it in production if your deployment target supports it (e.g. you'll probably want to scale up from
shared-cpu-1x
to prevent OOMs) and you are willing to pay the extra compute costs for the slight performance boost to the end-user. -
Add a
SESSION_SECRET
to your fly app secrets, to do this you can run the following commands:fly secrets set SESSION_SECRET=$(openssl rand -hex 32) --app nicholas-eng fly secrets set SESSION_SECRET=$(openssl rand -hex 32) --app nicholas-eng-staging
Note: When creating the staging secret, you may get a warning from the Fly CLI that looks like this:
WARN app flag 'nicholas-eng-staging' does not match app name in config file 'nicholas-eng'
This simply means that the current directory contains a config that references the production app we created in the first step. Ignore this warning and proceed to create the secret.
If you don't have openssl installed, you can also use 1password to generate a random secret, just replace
$(openssl rand -hex 32)
with the generated secret. -
Create a database for both your staging and production environments. Run the following:
fly postgres create --name nicholas-eng-db fly postgres attach --app nicholas-eng nicholas-eng-db fly postgres create --name nicholas-eng-staging-db fly postgres attach --app nicholas-eng-staging nicholas-eng-staging-db
Note: You'll get the same warning for the same reason when attaching the staging database that you did in the
fly set secret
step above. No worries. Proceed!
Fly will take care of setting the DATABASE_URL
secret for you.
Now that everything is set up you can commit and push your changes to your repo. Every commit to your main
branch will trigger a deployment to your production environment, and every commit to your dev
branch will trigger a deployment to your staging environment.
If you run into any issues deploying to Fly, make sure you've followed all of the steps above and if you have, then post as many details about your deployment (including your app name) to the Fly support community. They're normally pretty responsive over there and hopefully can help resolve any of your deployment issues and questions.
Once you have your site and database running in a single region, you can add more regions by following Fly's Scaling and Multi-region PostgreSQL docs.
Make certain to set a PRIMARY_REGION
environment variable for your app. You can use [env]
config in the fly.toml
to set that to the region you want to use as the primary region for both your app and database.
Install the ModHeader browser extension (or something similar) and use it to load your app with the header fly-prefer-region
set to the region name you would like to test.
You can check the x-fly-region
header on the response to know which region your request was handled by.
We use GitHub Actions for continuous integration and deployment. Anything that gets into the main
branch will be deployed to production after running tests/build/etc. Anything in the dev
branch will be deployed to staging.
We use Cypress for our End-to-End tests in this project. You'll find those in the cypress
directory. As you make changes, add to an existing file or create a new file in the cypress/e2e
directory to test your changes.
We use @testing-library/cypress
for selecting elements on the page semantically.
To run these tests in development, run pnpm test:e2e:dev
which will start the dev server for the app as well as the Cypress client. Make sure the database is running in docker as described above.
We have a utility for testing authenticated features without having to go through the login flow:
cy.login()
// you are now logged in as a new user
We also have a utility to auto-delete the user at the end of your test. Just make sure to add this in each test file:
afterEach(() => {
cy.cleanupUser()
})
That way, we can keep your local db clean and keep your tests isolated from one another.
For lower level tests of utilities and individual components, we use vitest
. We have DOM-specific assertion helpers via @testing-library/jest-dom
.
This project uses TypeScript. It's recommended to get TypeScript set up for your editor to get a really great in-editor experience with type checking and auto-complete. To run type checking across the whole project, run pnpm typecheck
.
This project uses ESLint for linting. That is configured in .eslintrc.js
.
We use Prettier for auto-formatting in this project. It's recommended to install an editor plugin (like the VSCode Prettier plugin) to get auto-formatting on save. There's also a pnpm format
script you can run to format all files in the project.