Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Thanks this is way better than existing sites.

I did notice that searching for Google gives you 5 different entities that are all Google.



Yes I am still cleaning the data. It's all unbelievable the amount of crap you can find in the public records. I have a lot of duplicates unfortunately.

Down the road, I'm planning on providing a couple of cool charts/map about trends and evolutions.


A few things I'd be interested in:

- ratio of LCAs to total permanent full time employees. This will show which companies are really leaning on the H1-B visa (I'd estimate my "household name" software company is at about 50%)

- Source of prevailing wage. Interestingly employers don't have to use the BOL published data and can self report. I'd be interested to see how many self report.

- The average delta between prevailing wage and salary per employer/job title.

- A way to search by geographic region.


Excellent ideas. Hope to see these things soon.


Check out Open Refine. Has a feature that clusters similar strings and unifies. I remember last time I looked at this data set... 4 letter acronyms spelled 12 different ways, it's unbelievably messy.


For search result of a company name, can you sort the results descending by # of LCA? this would put the most likely true result of top, I think.

I also noticed that the sort buttons (for salary) behave backwards of what I'd intuitively expect?

Would it be possible to auto-merge likely duplicate company names?

Great job, keep it up!! :)


I'm currently coding both feature. :)



Any plans to open-source this on Github and accept pull requests?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: