Draft:Gretel AI

{{Paid contributions|date=March 2025}}

{{AFC submission|d|corp|u=Mckornfield|ns=118|decliner=Gheus|declinets=20250303224847|small=yes|ts=20241227141205}}

{{AFC submission|d|corp|u=Mckornfield|ns=118|decliner=WeirdNAnnoyed|declinets=20241215135408|small=yes|ts=20241209193126}}

{{AFC comment|1=You can expand the draft based on these articles: [https://techcrunch.com/2021/10/07/gretel-ai-raises-50m-for-a-platform-that-lets-engineers-build-and-use-synthetic-datasets-to-ensure-the-privacy-of-their-actual-data/], [https://fortune.com/2024/06/13/gretel-ai-startup-synthetic-data/], [https://www.ft.com/content/053ee253-820e-453a-a1d5-0f24985258de] Gheus (talk) 22:48, 3 March 2025 (UTC)}}

{{AFC comment|1=I looked through the referenced papers and there's more than one reference for sure, two of them are explicitly about Gretel, one uses it as a comparison, and I added a fourth. If you think any are passing references, then let me know and I'll remove them. I also removed the link to Gretel's own website as a reference. Mckornfield}}

{{AFC comment|1=The sources cited in this article are either connected to the subject (Gretel's own website), trivial passing mentions, or about other topics entirely. We need significant coverage about the company in secondary, reliable sources to have an article. If such sources can be found the article can be re-created. WeirdNAnnoyed (talk) 13:54, 15 December 2024 (UTC)}}

----

{{Short description|Synthetic Data Generation Company}}

{{Draft topics|internet-culture|software|computing|technology}}

{{AfC topic|org}}

{{AFC comment|1=I looked through the referenced papers and there's more than one reference for sure, two of them are explicitly about Gretel, one uses it as a comparison, and I added a fourth. If you think any are passing references, then let me know and I'll remove them. I also removed the link to Gretel's own website as a reference. Mckornfield}}

{{Infobox website

| name = Gretel

| logo =

| location = San Diego, California, US

| founder = {{hlist|Ali Golshan|Alexander Watson|John Myers}}

| CEO = Ali Golshan{{cite news|url=https://odsc.com/blog/speaker/ali-golshan/|title=Ali Golshan|website=Open Data Science Conference|date=9 December 2024 |access-date=2024-12-09}}

| industry = Software

| url = {{URL|https://gretel.ai/}}

| foundation = {{Start date and age|Jan 2020}}

| area_served = Global

| num_employees = 50-100

}}

{{Infobox software

| title =

| name = Gretel AI

| developer = Gretel Labs

| released = {{Start date and age|2020|3|31}}

| ver layout = stacked

| license = [https://github.com/gretelai/gretel-python-client SDK] - Apache 2.0, [https://github.com/gretelai/gretel-synthetics Synthetics] - Source-available software

| platform = Amazon Web Services, Microsoft Azure, Google Cloud Platform

| programming language = Python

}}

Gretel (also known as Gretel Labs or Gretel AI) is a software startup focused around creating high quality and private Synthetic data. Its primary focus is on generating textual, JSON or tabular data. It accomplishes this using a mix of privacy preservation tools (transformations, differential privacy) in concert with data generation tools (Large language models, and custom Fine-tuning (deep learning)).

Gretel's quality enforcement is accomplished by performing quality checks during data generation, thereby reducing the amount of low quality data in the final dataset.

This type of enforcement can also apply to privacy concerns, by using privacy filters or introducing appropriate levels of noise during data generation.

Gretel's Open Source Datasets

Gretel has released a set of open source datasets (licensed under Apache 2.0) on Hugging Face.{{cite web |title=gretelai (Gretel.ai) |url=https://huggingface.co/gretelai|website=Hugging Face|date=30 October 2024 |access-date=9 December 2024 |archive-date=26 November 2024 |archive-url=https://web.archive.org/web/20241126022430/https://huggingface.co/gretelai |url-status=live }}

These datasets reflect what can be created using Gretel itself, as well as to allow for use in training models, creating tools, or building other sorts of AI systems.

Gretel in Research

Gretel's synthetics offering and platform have been referenced in a few research/comparison articles. Examples include:

  • Performance Analysis of Real and Synthetic Data using Supervised ML Algorithms for Prediction of Chronic Kidney Disease{{cite book |

author=M, Gayathri Hegde and Shenoy, P Deepa and R, Venugopal K|

chapter=Performance Analysis of Real and Synthetic Data using Supervised ML Algorithms for Prediction of Chronic Kidney Disease|

title=2022 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT)|

year=2022|

pages=1–6|

doi=10.1109/CONECCT55679.2022.9865722|

isbn=978-1-6654-9781-7}}

  • Gretel.ai: Open-Source Artificial Intelligence Tool To Generate New Synthetic Data{{cite journal |vauthors=Noruzman A, Ghani N, Zulkifli N|date=2021 |title=Gretel.ai: Open-Source Artificial Intelligence Tool To Generate New Synthetic Data.|url=http://myjieas.psa.edu.my/index.php/myjieas/article/view/27|journal=Malaysian Journal of Innovation in Engineering and Applied Social Sciences|volume=1|issue=1 |pages= |doi= |pmc= |pmid= |access-date=9 December 2024}}
  • Experiments in Reducing NLP Bias and Identifiability for Large LMs{{cite journal |vauthors=Herrera J, Bernal D |journal=TheEyeCorpus |title=Experiments in Reducing NLP Bias and Identifiability for Large LMs.}}
  • Performance Analysis of an Indoor LoRaWAN Network with Field Measurements and AI-Assisted Data Generation{{cite journal| vauthors=Nas A, Yildiz O, Karlik S| title=Performance Analysis of an Indoor LoRaWAN Network with Field Measurements and AI-Assisted Data Generation|journal=ICONSAD'23 3rd International Congress on Scientific Advances| location = Balikesir, Turkey| year=2023| pages=1 }}

References

{{reflist}}