### 1. Load Pandas in console and load csv data file
    import pandas as pd

    data = pd.read_csv(&quot;data.csv&quot;, sep = &quot;,&quot;)

### 2. Examine first few rows of data

    data.head() 

### 3. Calculate summary statistics

    summary = data.describe()

### 4. Transpose statistics to get similar format as R summary() function

    summary = summary.transpose()

### 5. Visualize summary statistics in console

    summary.head()



**No**. You&#39;ll need to use `pandas`.

R is for language for statistics, so many of the basic functionality you need, like `summary()` and `lm()`, are loaded when you boot it up. Python has many uses, so you need to install and import the appropriate statistical packages. `numpy` isn&#39;t a statistics package - it&#39;s for numerical computation more generally, so you need to use packages like `pandas`, `scipy` and `statsmodels` to allow Python to do what R can do out of the box.

If you are looking for details like summary() in R i.e 

 - 5 point summary for numeric variables    
 - Frequency of occurrence of each class for categorical variable

To achieve above in Python you can use df.describe(include= &#39;all&#39;).

I have in my .gradle folder, a 2.4 folder which is the version of gradle.
I want to downgrade to 2.2.1, because I need to use Gradle plugin 1.0.1.
I already try to change by:

&gt; distributionUrl=https\://services.gradle.org/distributions/gradle-2.2.1-all.zip

But this did not solve the issue and I&#39;m still with 2.4 version.

How can I solve this?

How to downgrade to older version of Gradle

I want to fire the JQuery `change` event when the input text is changed programmatically, for example like this:

&lt;!-- begin snippet: js hide: false console: true babel: false --&gt;

&lt;!-- language: lang-js --&gt;

    $(&quot;input&quot;).change(function(){
        console.log(&quot;Input text changed!&quot;);
    });
    $(&quot;input&quot;).val(&quot;A&quot;);

&lt;!-- language: lang-html --&gt;

    &lt;script src=&quot;https://ajax.googleapis.com/ajax/libs/jquery/2.1.1/jquery.min.js&quot;&gt;&lt;/script&gt;
    &lt;input type=&#39;text&#39; /&gt;

&lt;!-- end snippet --&gt;

But it doesn&#39;t work. How can I make this work? 
    
    

How to fire JQuery change event when input value changed programmatically?

Is there an equivalent of `R`&#39;s `summary()` function in `numpy`?

`numpy` has std, mean, average functions separately, but does it have a function that sums up everything, like `summary` does in `R`?

If found [this][1] question which relates to `pandas` and [this][2] article with R-to-numpy equivalents, but it doesn&#39;t have what I seek for.


  [1]: https://stackoverflow.com/questions/27637281/what-are-python-pandas-equivalents-for-r-functions-like-str-summary-and-he
  [2]: http://mathesaurus.sourceforge.net/r-numpy.html

R summary() equivalent in numpy

Is there an equivalent of <code>R</code>'s <code>summary()</code> function in <code>numpy</code>?
<code>numpy</code> has std, mean, average functions separately, but does it have a function that sums up everything, like <code>summary</code> does in <code>R</code>?
If found <a href="https://stackoverflow.com/questions/27637281/what-are-python-pandas-equivalents-for-r-functions-like-str-summary-and-he" target="_blank" rel="noopener noreferrer">this</a> question which relates to <code>pandas</code> and <a href="http://mathesaurus.sourceforge.net/r-numpy.html" target="_blank" rel="noopener noreferrer">this</a> article with R-to-numpy equivalents, but it doesn't have what I seek for.

I am using TensorFlow to train a neural network. This is how I am initializing the `GradientDescentOptimizer`:

    init = tf.initialize_all_variables()
    sess = tf.Session()
    sess.run(init)

    mse        = tf.reduce_mean(tf.square(out - out_))
    train_step = tf.train.GradientDescentOptimizer(0.3).minimize(mse)

The thing here is that I don&#39;t know how to set an update rule for the learning rate or a decay value for that. 

How can I use an adaptive learning rate here?

    

How to set adaptive learning rate for GradientDescentOptimizer?

I am reading some example codes in Tensorflow, I found following code 

    flags = tf.app.flags
    FLAGS = flags.FLAGS
    flags.DEFINE_float(&#39;learning_rate&#39;, 0.01, &#39;Initial learning rate.&#39;)
    flags.DEFINE_integer(&#39;max_steps&#39;, 2000, &#39;Number of steps to run trainer.&#39;)
    flags.DEFINE_integer(&#39;hidden1&#39;, 128, &#39;Number of units in hidden layer 1.&#39;)
    flags.DEFINE_integer(&#39;hidden2&#39;, 32, &#39;Number of units in hidden layer 2.&#39;)
    flags.DEFINE_integer(&#39;batch_size&#39;, 100, &#39;Batch size.  &#39;
                     &#39;Must divide evenly into the dataset sizes.&#39;)
    flags.DEFINE_string(&#39;train_dir&#39;, &#39;data&#39;, &#39;Directory to put the training data.&#39;)
    flags.DEFINE_boolean(&#39;fake_data&#39;, False, &#39;If true, uses fake data &#39;
                     &#39;for unit testing.&#39;)

 in `tensorflow/tensorflow/g3doc/tutorials/mnist/fully_connected_feed.py`

But I can&#39;t find any docs about this usage of `tf.app.flags`. 

And I found the implementation of this flags is in the 
[`tensorflow/tensorflow/python/platform/default/_flags.py`][1]

Obviously, this `tf.app.flags` is somehow used to configure a network, so why  is it not in the API docs? Can anyone explain what is going on here? 


  [1]: https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/platform/default/_flags.py

What&#39;s the purpose of tf.app.flags in TensorFlow?

I&#39;m trying to construct a simple function that takes a subplot instance (`matplotlib.axes._subplots.AxesSubplot`) and transforms its projection to another projection, for example, to one of the `cartopy.crs.CRS` projections.

The idea looks something like this

    import cartopy.crs as ccrs
    import matplotlib.pyplot as plt

    def make_ax_map(ax, projection=ccrs.PlateCarree()):
        # set ax projection to the specified projection
        ...
        # other fancy formatting
        ax2.coastlines()
        ...
    
    # Create a grid of plots
    fig, (ax1, ax2) = plt.subplots(ncols=2)
    # the first subplot remains unchanged
    ax1.plot(np.random.rand(10))
    # the second one gets another projection
    make_ax_map(ax2)
    
Of course, I can just use `fig.add_subplot()` function:

    fig = plt.figure(figsize=(10,5))
    ax1 = fig.add_subplot(121)
    ax1.plot(np.random.rand(10))

    ax2 = fig.add_subplot(122,projection=ccrs.PlateCarree())
    ax2.coastlines()

but I was wondering if there is a proper `matplotlib` method to change a subplot axis projection *after* it was defined. Reading matplotlib API didn&#39;t help unfortunately.

How do I change matplotlib&#39;s subplot projection of an existing axis?

I have a function in python that can either return a `bool` or a `list`. Is there a way to specify the return types using type hints?

For example, is this the correct way to do it?

    def foo(id) -&gt; list or bool:
        ...

How to specify multiple return types using type-hints

I would like to use _batch normalization_ in TensorFlow. I found the related C++ source code in [`core/ops/nn_ops.cc`][1]. However, I did not find it documented on tensorflow.org.

BN has different semantics in MLP and CNN, so I am not sure what exactly this BN does.

I did not find a method called `MovingMoments` either.

  [1]: https://github.com/tensorflow/tensorflow/blob/master/tensorflow/core/ops/nn_ops.cc

How could I use batch normalization in TensorFlow?

Using the following code I got the data I wanted, but for some reason I can&#39;t figure out `knitr` doesn&#39;t let me compile a PDF document, as shown further below:

My code:

    install.packages(&quot;weatherData&quot;)
    library(weatherData)
    istanbul &lt;- getWeatherForDate(&quot;Istanbul&quot;,
                                  start_date = Sys.Date() - 41, 
                                  end_date = Sys.Date())

Works out with no problem but I get the following message trying compile the PDF:

    Quitting from lines 3-31 (ist_weather.spin.Rmd) 
    Error in contrib.url(repos, type) : 
      trying to use CRAN without setting a mirror
    Calls: &lt;Anonymous&gt; ... eval -&gt; eval -&gt; install.packages -&gt; grep -&gt; contrib.url
    Execution halted

install.packages fails in knitr document: &quot;trying to use CRAN without setting a mirror&quot;

I&#39;ve seen similar questions on Stack Overflow but virtually no conclusive answers, and certainly no answer that worked for me.

What is the easiest way to access and use objects (regression fits, data frames, other objects) that are located in the global R environment in the Markdown (Rstudio) script.

I find it surprising that there is no easy solution to this, given the tendency of the RStudio team to make things comfortable and effective.

Thanks in advance.

How to use objects from global environment in Rstudio Markdown

I&#39;m in the process of trying out a dplyr-based workflow (rather than using mostly data.table, which I&#39;m used to), and I&#39;ve come across a problem that I can&#39;t find an equivalent dplyr solution to. I commonly run into the scenario where I need to conditionally update/replace several columns based on a single condition. Here&#39;s some example code, with my data.table solution: 

    library(data.table)

    # Create some sample data
    set.seed(1)
    dt &lt;- data.table(site = sample(1:6, 50, replace=T),
                     space = sample(1:4, 50, replace=T),
                     measure = sample(c(&#39;cfl&#39;, &#39;led&#39;, &#39;linear&#39;, &#39;exit&#39;), 50, 
                                   replace=T),
                     qty = round(runif(50) * 30),
                     qty.exit = 0,
                     delta.watts = sample(10.5:100.5, 50, replace=T),
                     cf = runif(50))
    
    # Replace the values of several columns for rows where measure is &quot;exit&quot;
    dt &lt;- dt[measure == &#39;exit&#39;, 
             `:=`(qty.exit = qty,
                  cf = 0,
                  delta.watts = 13)]

Is there a simple dplyr solution to this same problem? I&#39;d like to avoid using ifelse because I don&#39;t want to have to type the condition multiple times - this is a simplified example, but there are sometimes many assignments based on a single condition. 

Thanks in advance for the help!

dplyr mutate/replace several columns on a subset of rows

**update: Brandon Bertelsen&#39;s answer**:


Brandon&#39;s answer produces the following output. 
It doesn&#39;t produce nice tables or highlight code like Rstudio does, and it crashes on some html files with unicode, so I&#39;m not using it to automate my email reports. 

My current approach is to compile with Rstudio to html, open the html document in chrome, and then copy and paste the html document into gmail. This works pretty well, see this gist: https://gist.github.com/nelsonauner/a68b5a808c232ce7817e


 [![enter image description here][1]][1]


Original question:
========


Is there an easy way to send an R markdown document as the body of an email, so that the body of the email looks similar to the results of using Rstudio&#39;s &quot;Knit HTML&quot; ?  

Here&#39;s a basic reproducible example using `knitr`, `rmarkdown`, and `mailR`
## example.Rmd

    ---
    title: &quot;Report for email&quot;
    output: 
      html_document: 
        self_contained: no
    ---
    
    ```{r}
    summary(cars)  
    ```
    
    You can also embed plots, for example:
    
    ```{r, echo=FALSE}
    plot(cars)
    ```

I&#39;m using `self_contained: no` since the default base64 encoding does not work with `mailR` (recommended by Yihui in [this SO post](https://stackoverflow.com/questions/32520928/in-r-is-there-any-way-to-send-an-rmarkdown-v2-html-file-as-the-body-of-an-email))


## knit_and_send.R
    # compile using rmarkdown
    library(rmarkdown)
    rmarkdown::render(&quot;example.Rmd&quot;)
    
    library(mailR)
    
    send.mail(from = &quot;me@gmail.com&quot;,
              to = &quot;me@gmail.com&quot;,
              subject = &quot;R Markdown Report - rmarkdown&quot;,
              html = T,
              inline = T,
              body = &quot;example.html&quot;,
              smtp = list(host.name = &quot;smtp.gmail.com&quot;, port = 465, user.name = &quot;me&quot;, passwd = &quot;password&quot;, ssl = T),
              authenticate = T,
              send = T)

    #compile using knitr
    library(knitr)
    knit2html(&quot;example.Rmd&quot;,options=&quot;&quot;)
    
    send.mail(from = &quot;me@gmail.com&quot;,
              to = &quot;me@gmail.com&quot;,
              subject = &quot;R Markdown Report - knitr&quot;,
              html = T,
              inline = T,
              body = &quot;example.html&quot;,
              smtp = list(host.name = &quot;smtp.gmail.com&quot;, port = 465, user.name = &quot;me&quot;, passwd = &quot;password&quot;, ssl = T),
              authenticate = T,
              send = T)


Both emails send successfully. 

The knitted email looks like this: 


-----

[![knitted and emailed report][2]][2]

-----

and the rmarkdown email looks like this. (Notice that it also includes a bunch of javascript files--I think I&#39;d have to write some scripts to remove them)

-------

[![enter image description here][3]][3]

----------

But neither of them look as nice as the report that is produced from Rstudio&#39;s &quot;Knit as HTML&quot;, which looks like this: 

[![enter image description here][4]][4]



Any suggestions? 


I think a true fix might involve some postprocessing of the html file that incorporate the css styling in an email-friendly way while removing the javascript files. 

For now, I&#39;ll use the `knitr` package. 

Please let me know if something isn&#39;t clear and I&#39;ll improve the question. 


Relevant SO posts: 

https://stackoverflow.com/questions/32520928/in-r-is-there-any-way-to-send-an-rmarkdown-v2-html-file-as-the-body-of-an-email

https://stackoverflow.com/questions/24346856/mailr-how-to-send-rmarkdown-documents-as-body-in-email


  [1]: http://i.stack.imgur.com/YNuCw.png
  [2]: http://i.stack.imgur.com/83khP.png
  [3]: http://i.stack.imgur.com/avVaD.png
  [4]: http://i.stack.imgur.com/3eGNm.png

How to send R markdown report in body of email?

I am quite new to R.  

Using the table called `SE_CSVLinelist_clean`, I want to extract the rows where the Variable called `where_case_travelled_1` DOES NOT contain the strings `&quot;Outside Canada&quot;` OR `&quot;Outside province/territory of residence but within Canada&quot;`.  Then create a new table called `SE_CSVLinelist_filtered`.

    SE_CSVLinelist_filtered &lt;- filter(SE_CSVLinelist_clean, 
    where_case_travelled_1 %in% -c(&#39;Outside Canada&#39;,&#39;Outside province/territory of residence but within Canada&#39;))

The code above works when I just use &quot;c&quot; and not &quot;-c&quot;.  
So, how do I specify the above when I really want to exclude rows that contains that outside of the country or province?



How to specify &quot;does not contain&quot; in dplyr filter

I want to create a numpy array in which each element must be a list, so later I can append new elements to each.

I have looked on google and here on stack overflow already, yet it seems nowhere to be found.

Main issue is that numpy assumes your list must become an array, but that is not what I am looking for.



How to create a numpy array of lists?

How to convert a tensor into a numpy array when using Tensorflow with Python bindings?

Convert a tensor to numpy array in Tensorflow?

I recently moved to Python 3.5 and noticed the [new matrix multiplication operator (@)][1] sometimes behaves differently from the [numpy dot][2] operator. In example, for 3d arrays:

    import numpy as np
    
    a = np.random.rand(8,13,13)
    b = np.random.rand(8,13,13)
    c = a @ b  # Python 3.5+
    d = np.dot(a, b)

The `@` operator returns an array of shape:

    c.shape
    (8, 13, 13)

while the `np.dot()` function returns:

    d.shape
    (8, 13, 8, 13)

How can I reproduce the same result with numpy dot? Are there any other significant differences?


  [1]: https://docs.python.org/3/whatsnew/3.5.html#whatsnew-pep-465
  [2]: http://docs.scipy.org/doc/numpy/reference/generated/numpy.dot.html

Difference between numpy dot() and Python 3.5+ matrix multiplication @

I need to fit `RandomForestRegressor` from `sklearn.ensemble`.

    forest = ensemble.RandomForestRegressor(**RF_tuned_parameters)
    model = forest.fit(train_fold, train_y)
    yhat = model.predict(test_fold)

This code always worked until I made some preprocessing of data (`train_y`).
The error message says:

&gt;    DataConversionWarning: A column-vector y was passed when a 1d array was expected. Please change the shape of y to (n_samples,), for example using ravel().
&gt;
&gt;    model = forest.fit(train_fold, train_y)

Previously `train_y` was a Series, now it&#39;s numpy array (it is a column-vector). If I apply `train_y.ravel()`, then it becomes a row vector and no error message appears, through the prediction step takes very long time (actually it never finishes...).

In the docs of `RandomForestRegressor` I found that `train_y` should be defined as `y : array-like, shape = [n_samples] or [n_samples, n_outputs]`
Any idea how to solve this issue?




A column-vector y was passed when a 1d array was expected

In order to find the index of the smallest value, I can use `argmin`: 

    import numpy as np
    A = np.array([1, 7, 9, 2, 0.1, 17, 17, 1.5])
    print A.argmin()     # 4 because A[4] = 0.1

&lt;/p&gt;
But how can I find the indices of the **k-smallest values**?

I&#39;m looking for something like:

    print A.argmin(numberofvalues=3)   
    # [4, 0, 7]  because A[4] &lt;= A[0] &lt;= A[7] &lt;= all other A[i]

&lt;/p&gt;

*Note: in my use case A has between ~ 10 000 and 100 000 values, and I&#39;m interested for only the indices of the k=10 smallest values. k will never be &gt; 10.*

Content Type	Original Author	Original Content on Stackoverflow
Question	iulian	View Question on Stackoverflow
Solution 1 - Python	Thomas Hepner	View Answer on Stackoverflow
Solution 2 - Python	Eoin	View Answer on Stackoverflow
Solution 3 - Python	SKB	View Answer on Stackoverflow

R summary() equivalent in numpy

Python Problem Overview

Python Solutions

Solution 1 - Python

1. Load Pandas in console and load csv data file

2. Examine first few rows of data

3. Calculate summary statistics

4. Transpose statistics to get similar format as R summary() function

5. Visualize summary statistics in console

Solution 2 - Python

Solution 3 - Python

How to fire JQuery change event when input value changed programmatically?

How to downgrade to older version of Gradle

Attributions