Skip to content

anon-r-7/h1b

Repository files navigation

H-1B Dashboard

Welcome to the H-1B Dashboard, a tool for viewing and analyzing H-1B data.

Built and maintained by Ryan Ostrom. DM for requests or questions.


I. For Analysts

1. Source of Data

  • Data is sourced directly from the Department of Labor for each fiscal quarter (2023-current).
  • Quarterly data is ingested into a database to enable efficient querying.

2. Business Logic

  • Scope: Includes aggregate data for all approved jobs in the dataset (currently, 2023-2024).
  • Approval Criteria: Queries include only jobs with:
    • Case Status: "Certified" or "Certified - Withdrawn" (approved positions).
    • Wage Unit of Pay: "Yearly" (excludes hourly pay for data consistency).
  • Job Count: Queries sum the total "Total Worker Positions" for approved records. A single record can represent multiple positions (e.g., 10).
  • Normalization: Data such as "Tesla, Inc." and "Tesla Inc." (with extra whitespace) are merged for consistency.
  • Integrity: Other than stated adjustments, data is not modified, ensuring accurate reporting.

3. Definitions

Term Definition
Company Defined by "Employer Name".
Job Title Defined by "SOC Title" (a normalized descriptor, preferred over "Job Title").
State Defined by "Workplace State" (the actual work location, not the employer HQ).
Salary Based on "Wage Rate Of Pay From" (the minimum salary value).
Highest Paying Jobs with the highest percentage of compensation in the >$250k bracket.
Lowest Paying Jobs with the highest percentage of compensation in the <$50k bracket.
Most Jobs Jobs with the highest count of "Total Worker Positions" in a given segment (e.g., Company, State).

II. For Developers

A. Installation

1. Clone Repo

$ git clone git@github.com:anon-r-7/h1b.git

2. Set Env Vars

Pre-requisite: direnv

$ cp .envrc.example .envrc
# set env vars accordingly

3. Set Seed Data

i. Download Quarterly Files
ii. Set quarterly files
  • Move files to packages/api/db/utils/data/.
  • Note: File names should be formatted as follows LCA_Disclosure_Data_FYYYY_QN.xlsx

B. Run Locally

$ docker compose up

Note: Migrations will run on first run to convert spreadsheets to db data, this may take several minutes.

C. Dev Tools

1. Lint Fix

$ yarn lint:fix
$ yarn prettier:fix

2. Lint Check

$ yarn tsc
$ yarn lint
$ yarn prettier

D. Contributing

$ git checkout main
$ git pull origin main
$ git checkout -b feature/<feature>
$ git push origin feature/<feature>
Create PR from feature branch to main

III. Disclaimer

This application aims to provide accurate H-1B data based on publicly available datasets. It does not endorse or certify any companies or job roles listed nor make any guarantees of accuracy, errors, or completeness.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors